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Abstract 

r~| ' Einstein's kinetic theory of the Brownian motion, based upon light water molecules 

Qh continuously bombarding a heavy pollen, provided an explanation of diffusion from the 

Newtonian mechanics. Since the discovery of quantum mechanics it has been a challenge 
to verify the emergence of diffusion from the Schrodinger equation. 

The first step in this program is to verify the linear Boltzmann equation as a certain 
scaling limit of a Schrodinger equation with random potential. In the second step, one 
considers a longer time scale that corresponds to infinitely many Boltzmann collisions. 
■ The intuition is that the Boltzmann equation then converges to a diffusive equation 

similarly to the central limit theorem for Markov processes with sufficient mixing. In 
these lecture notes (prepared for the Les Houches summer school in 2010 August) we 
present the mathematical tools to rigorously justify this intuition. The new material 
. relies on joint papers with H.-T. Yau and M. Salmhofer. 
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1 Overview of the rigorous derivations of diffusions 

The fundamental equations governing the basic laws of physics, the Newton and the Schrodinger 
equations, are time reversible and have no dissipation. It is remarkable that dissipation is nev- 
ertheless ubiquitous in nature, so that almost all macroscopic phenomenological equations are 
dissipative. The oldest such example is perhaps the equation of heat conductance found 
by Fourier. This investigation has led to the heat equation, the simplest type of diffusion 
equations: 



where u(x,t) denotes the temperature at position x G M d and time t. One key feature of the 
diffusion equations is their inhomogeneous scaling; the time scales as the square of the spatial 
distance: 



In these lectures we will explore how diffusion equations emerge from first principle physi- 
cal theories such as the classical Newtonian dynamics and the quantum Schrodinger dynamics. 
In Section [1] we give an overview of existing mathematical results on the derivation of diffu- 
sion (and related) equations. In Sections [2H1] we discuss the basic formalism and present a 
few well-known preliminary facts on stochastic, classical and quantum dynamics. An experi- 
enced reader can skip these sections. In Section [5] we introduce our main model, the random 
Schrodinger equation or the quantum Lorentz gas and its lattice version, the Anderson model. 
In Section [6] we formulate our main theorems that state that the random Schrodinger equation 
exhibits diffusion and after a certain scaling and limiting procedure it can be described by a 
heat equation. 

The remaining sections contain the sketch of the proofs. Since these proofs are quite 
long and complicated, we will not only have to omit many technical details, but even several 
essential ideas can only be mentioned very shortly. We will primarily focus on the most 
important aspect: the classification and the estimates of the so-called non-repetition Feynman 
graphs. Estimates of other Feynman graphs, such as recollision graphs and graphs with higher 
order pairings will only be discussed very superficially. 

Our new results presented in this review were obtained in collaboration with H.-T. Yau 
and M. Salmhofer. 

Acknowledgement. The author is grateful to M. Salmhofer for many suggestions to improve 
this presentation and for making his own lecture notes available from which, in particular, the 
material of Section 17.21 and 18.31 was borrowed. 

1.1 The founding fathers: Brown, Einstein, Boltzmann 

The story of diffusion starts with R. Brown in 1827 who observed almost two centuries ago that 
the motion of a wildflower pollen suspended in water was erratic [6]. He saw a picture similar 



d t u = A x u, 





time ~ (distance) 2 . 
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Figure 1: Brown's picture under the microscope 
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Figure 2: Einstein's explanation to Brown's picture 

to Fig. [I] under his microscope (the abrupt velocity changes in this picture are exaggerated to 
highlight the effect, see later for more precise explanation). He "observed many of them very 
evidently in motion" and argued that this motion was not due to water currents but "belonged 
to the particle itself" . First he thought that this activity characterizes only "living" objects 
like pollens, but later he found that small rock or glass particles follow the same pattern. He 
also noted that this apparently chaotic motion never seems to stop. 

This picture led to the kinetic explanation by A. Einstein in 1905 [15] that such a motion 
was created by the constant kicks on the relatively heavy pollen by the light water molecules. 
Independently, M. Smoluchowski jH] arrived at a similar theory. It should be noted that 
at that time even the atomic-molecular structure of matter was not universally accepted. 
Einstein's theory was strongly supported by Boltzmann's kinetic theory, which, however, was 
phenomenological and seriously debated at the time. Finally in 1908 Perrin j5T] (awarded 
the Nobel prize in 1926) experimentally verified Einstein's theory and used it, among others, 
to give a precise estimate on the Avogadro number. These experiments gave the strongest 
evidence for atoms and molecules at that time. 

We should make a remark on Boltzmann's theory although this will not be the subject of 
this lecture. Boltzmann's kinetic theory postulates that in a gas of interacting particles at 
relatively low density the collisions between the particles are statistically independent (Ansatz 
of molecular chaos). This enabled him to write up the celebrated (nonlinear) Boltzmann 
equation for the time evolution of the single particle phase space density f t (x,v). The main 
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assumption is that the collision rate is simply the product of the particle densities with the 
appropriate velocities, thus he arrived at the following equation: 



4/.(*,.)+«-v./,(*,«) = y»(«,v 1 )[fi(*,rf)/,(*^)-/,(*,«)/,(*^)]. (i.2) 

Here v, V\ is the pair of incoming velocities of the colliding particles and v', v[ are the outgoing 
velocities. In case of elastic hard balls, the pair (v',v[) is uniquely determined by (v,Vi) (plus 
a randomly chosen contact vector) due to energy and momentum conservation, for other type 
of collisions the kernel a(v, vi) describes microscopic details of the collision mechanism. The 
integration is for the random contact vector and all other variables apart from x, v, subject 
to constraints (momentum and energy conservation) encoded in o~(v,Vi). 

In addition to the highly non-trivial Ansatz of independence, this theory also assumes the 
molecular nature of gases and fluids in contrast to their more intuitive continuous description. 
Nowadays we can easily accept that gases and fluids on large scales (starting from micrometer 
scales and above - these are called macroscopic scales) can be characterized by continuous 
density functions, while on much smaller scales (nanometers and below, these are called mi- 
croscopic scales) the particle nature dominates. But at Boltzmann's time the particle picture 
was strongly debated. The ingenuity of Boltzmann's theory is that he could combine these two 
pictures. The Boltzmann equation is an equation of continuous nature (as it operates with 
density functions) but its justification (especially the determination of the collision kernel) 
follows an argument using particles. No wonder that it gave rise to so much controversy espe- 
cially before experiments were available. Moreover, Boltzmann theory implies irreversibility 
(entropy production) that was for long thought to be more a philosophical than a physical 
question. 

After this detour on Boltzmann we return to the diffusion models. Before we discuss 
individual models, let me mention the key conceptual difficulty behind all rigorous derivations 
of diffusive equations. Note that the Hamiltonian dynamics (either classical or quantum) is 
reversible and deterministic. The diffusion equation (II. ip . and also the Boltzmann equation, 
is irreversible: there is permanent loss of information along their evolution (recall that the 
heat equation is usually not solvable backward in time). Moreover, these equations exhibit a 
certain chaotic nature (Brown's picture). How can these two contradicting facts be reconciled? 

The short answer is that the loss of information is due to a scale separation and the 
integration of microscopic degrees of freedom. On the microscopic (particle) level the dynamics 
remains reversible. The continuous (fluid) equations live on the macroscopic (fluid) scale: they 
are obtained by neglecting (more precisely, integrating out) many degrees of freedom on short 
scales. Once we switch to the fluid description, the information in these degrees of freedom is 
lost forever. 

This two-level explanation foreshadows that for a rigorous mathematical description one 
would need a scale separation parameter, usually called e that describes the ratio between the 
typical microscopic and macroscopic scales. In practice this ratio is of order 

1 Angstrom s 

6 = -2 = 1(T 8 , 

1 cm 

but mathematically we will consider the e — > so-called scaling limit. 
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Once the scale separation is put on a solid mathematical ground, we should note another 
key property of the macroscopic evolution equations we will derive, namely their Markovian 
character. Both the heat equation ( 11. II) and the nonlinear Boltzmann equation (11.20 give 
the time derivative of the macroscopic density at any fixed time t in terms of the density 
at the same time only. I.e. these evolution equations express a process where the future 
state depends only on the present state; the dependence of the future on the past is only 
indirect through the present state. We will call this feature Markovian property or in short 
Markovity. Note that colliding particles do build up a memory along their evolution: the 
state of the recolliding particles will remember their previous collisions. This effect will have 
to be supressed to obtain a Markovian evolution equation, as it was already recognized by 
Boltzmann in his Ansatz. There are essentially two ways to reduce the rate of recollisions: 
either by going to a low density situation or by introducing a small coupling constant. 

Apart from the rigorous formulation of the scaling limit, we thus will have to cope with 
the main technical difficulty: controlling memory (recollision) effects. Furthermore, 
specifically in quantum models, we will have to control interference effects as well. 

Finally, we should remark that Einstein's model is simpler than Boltzmann's, as light 
particles do not interact (roughly speaking, Boltzmann's model is similar to Fig. [2] but the 
light particles can also collide with each other and not only with the heavy particle). Thus 
the verification of the Ansatz of molecular chaos from first principle Hamiltonian dynamics is 
technically easier in Einstein's model. Still, Einstein's model already addresses the key issue: 
how does diffusion emerge from Hamiltonian mechanics? 

1.2 Mathematical models of diffusion 

The first step to setup a correct mathematical model is to recognize that some stochasticity 
has to be added. Although we expect that for a "typical" initial configuration the diffusion 
equations are correct, this certainly will not hold for all initial configuration. We expect 
that after opening the door between a warm and a cold room, the temperature will equilibrate 
(thermalize) but certainly there exists a "bad" initial configuration of the participating iV ~ 
10 23 particles such that all "warm" particles will, maybe after some time, head towards the 
cold room and vice versa; i.e. the two room temperatures will be exchanged instead of 
thermalization. Such configuration are extremely rare; their measure in all reasonable sense 
goes to zero very fast as N — > oo, but nevertheless they are there and prevent us from deriving 
diffusion equations for all initial configurations. It is, however, practically hopeless to describe 
all "bad" initial configurations; they may not be recognizable by glancing at the initial state. 
The stochastic approach circumvents this problem by precisely formulating what we mean by 
saying that the "bad events are rare" without identifying them. 

The stochasticity can be added in several different ways and this largely determines the 
technical complications involved in the derivation of the diffusion. In general, more stochas- 
ticity is added, the easier the derivation is. 

The easiest is if the dynamics itself is stochastic; the typical example being the classical 
random walk and its scaling limit, the Wiener process (theorized by Wiener [47] in 1923). In 
the typical examples, the random process governing the dynamics of the system has no corre- 
lation. The whole microscopic dynamics is Markovian (in some generalizations the dynamics 
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of the one particle densities is not fully Markovian, although it typically has strong statisti- 
cal mixing properties). Of course this dynamics is not Hamiltonian (no energy conservation 
and no reversibility), and thus it is not particularly surprising that after some scaling limit 
one obtains a diffusion equation. The quantum counterpart of the classical random walk is 
a Schrddinger equation with a Markovian time dependent random potential, see [12] and the 
recent proof of quantum diffusion in this model by Kang and Schenker [32J. 

The next level of difficulty is a Hamiltonian system with a random Hamiltonian. Some 
physical data in the Hamiltonian (e.g. potential or magnetic field) is randomly selected (de- 
scribing a disordered system), but then they are frozen forever and then the Hamiltonian 
dynamics starts (this is in sharp contrast to the stochastic dynamics, where the system is 
subject to fresh random inputs along its evolution). A typical model is the Lorentz gas, where 
a single Hamiltonian particle is moving among fixed random scatterers. Recollisions with the 
same scatterer are possible and this effect potentially ruins the Markovity. Thus this model 
is usually considered in the weak coupling limit, i.e. the coupling parameter A between the 
test-particle and the random scatterers is set to converge to zero, A — > 0, and simultaneously 
a long time limit is considered. For a weak coupling, the recollision becomes a higher order 
effect and may be neglected. It turns out that if 



then one can prove a nontrivial Markovian diffusive limit dynamics (see Kesten and Papanico- 
laou |33j, extended more recently to a bit longer times by Komorowski and Ryzhik [35]). The 
relation (II .31) between the coupling and the time scale is called the van Hove limit. Similar 
supressing of the recollisions can be achieved by the low density limit (see later). 

The corresponding quantum model (Anderson model) is the main focus of the current lec- 
ture notes. We will show that in the van Hove limit the linear Boltzmann equation arises after 
an appropriate rescaling and for longer time scales the heat equation emerges. In particular, 
we describe the behavior of the quantum evolution in the presumed extended states regime 
of the Anderson model up to a certain time scale. We will use a mathematically rigorous 
perturbative approach. We mention that supersymmetric methods [16J offer a very attrac- 
tive alternative approach to study quantum diffusion, although the mathematical control of 
the resulting functional integrals is difficult. Effective models that are reminiscent to the 
saddle point approximations of the supersymmetric approach are more accessible to rigorous 
mathematics. Recently Disertori, Spencer and Zirnbauer have proved a diffusive estimate for 
the two-point correlation functions in a three dimensional supersymmetric hyperbolic sigma 
model [13] at low temperatures and localization was also established in the same model at 
high temperatures [12"] . 

The following level of difficulty is a deterministic Hamiltonian with random initial 
data of many degrees of freedom. The typical example is Einstein's model, where the 
test-particle is immersed in a heat bath. The heat bath is called ideal gas, if it is characterized 
by a Hamiltonian H^^ of non-interacting particles. The initial state is typically a temperature 
equilibrium state, exp (—(3H bath ) i.e. the initial data of the heat-bath particles are given by 
this equilibrium measure at inverse temperature (3. Again, some scaling limit is necessary to 
reduce the chance of recollisions, one can for example consider the limit m/M — > oo, where M 



t ~ A 



-2 




7 



and m are the mass of the test-particle and the heat-bath particles, respectively. In this case, 
a collision with a single light particle does not have sizable effect on the motion of the heavy 
particle (this is why the abrupt changes in velocity in Fig. [T] are exaggeration). Therefore, the 
rate of collisions has to be increased in parallel with m/M — >■ to have a sizeable total collision 
effect. Similarly to the van Hove limit, there is a natural scaling limit, where nontrivial limiting 
dynamics was proven by Diirr, Goldstein and Lebowitz [H]. On the quantum side, this model 
corresponds to a quantum particle immersed in a phonon (or possibly photon) bath. On the 
kinetic scale the Boltzmann equation was proven in [18] . Recently De Roeck and Frohlich 
[TO] (see also [TT] for an earlier toy model) have proved quantum diffusion for a related model 
in d > 4 dimensions, where the mass of the quantum particle was large and an additional 
internal degree of freedom ( "spin" ) was coupled to the heat bath to enhance the decay of time 
correlations. 

We remark that the problem becomes enormously more complicated if the heat-bath 
particles can interact among each other, i.e. if we are facing a truly interacting many- 
body dynamics. In this case there is even no need to distinguish between the tracer particle and 
the heat-bath particles, in the simplest model one just considers identical particles interacting 
via a two-body interaction and one investigates the single-particle density function f t (x,v). 
In a certain scaling limit regime, the model should lead to the nonlinear Boltzmann equation 
(11. 2p . This has only been proven in the classical model for short time by Lanford [36]. His work 
appeared in 1975 and since then nobody could extend the proof to include longer time scales. 
We mention that the complications are partly due to the fact that the nonlinear Boltzmann 
equation itself does not have a satisfactory existence theory for long times. The corresponding 
quantum model is an unsolved problem; although via an order by order expansion [5] there is 
no doubt on the validity of the nonlinear Boltzmann equation starting from a weakly coupled 
interacting many-body model, the expansion cannot be controlled up to date. Lukkarinen 
and Spohn [39] have studied a weakly nonlinear cubic Schrddinger equation with a random 
initial data (drawn from a phonon bath) near equilibrium. They proved that the space- 
time covariance of the evolved random wave function satisfies a nonlinear Boltzmann-type 
equation in the kinetic limit, for short times. While this is still a one-particle model and only 
fluctuations around equilibrium is studied, its nonlinear character could be interpreted as a 
first step towards understanding the truly many-body Boltzmann equation. 

Finally, the most challenging (classical) model is the deterministic Hamiltonian with 
a random initial data of a few degrees of freedom, the typical example being the various 
mathematical billiards. The simplest billiard is the hard-core periodic Lorentz gas (also called 
Sinai's billiard [S]), where the scatterers are arranged in a periodic lattice and a single point 
particle ( "billiard ball" ) moves among these scatterers according to the usual rules of specular 
reflections. The methods of proofs here originate more in dynamical system than in statistical 
mechanics and they have a strong geometric nature (e.g. convexity of the scatterers is heavily 
used) . 

All these models have natural quantum mechanical analogues; for the Hamiltonian systems 
they simply follow from standard quantization of the classical Hamiltonian. These models are 
summarized in the table below and they are also illustrated in Fig. [3j We note that the 
quantum analogue of the periodic Lorentz gas is ballistic due to the Bloch waves (a certain 
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diagonalization procedure similar to the Fourier transform), thus in this case the classical and 
quantum models behave differently; the quantum case being relatively trivial. 





CLASSICAL MECHANICS 


QUANTUM MECHANICS 


Stochastic dynamics 
(no memory) 


Random walk (Wiener) 


Random kick model with 
zero time corr. potential 
(Pillct, Schenkcr-Kang) 


Hamiltonian particle in a 
random environment 
(one body) 


Lorentz, 0"as' narticle 
in random scatterers 
(Kesten-Papanicolaou) 
(Komorowski-Ryzhik) 


Anderson model 

or quantum Lorentz gas 

(Spohn, Erdos-Yau, Erdos-Salmhofer-Yau 

Disertori-Spencer-Zirnbauer) 


Hamiltonian particle 
in a heat bath 
(randomness in the 
many-body data) 


Einstein's kinetic model 
(Diirr-Goldstein-Lebowitz) 


Electron in phonon 
or photon bath 

(Erdos, Erdos- Adami, Spohn-Lukkarinen 
Dc Rocck Frohlich) 


Periodic Lorentz gas 
(randomness in the 
one-body initial data) 


Sinai billiard 
(Bunimovich-Sinai) 


Ballistic 

(Bloch waves, easy) 


Many-body interacting 
Hamiltonian 


Nonlinear Boltzmann cq 
(short time: Lanford) 


Quantum NL Boltzmann 
(unsolved) 



2 Some facts on stochastic processes 

Our goal is to understand the stochastic features of quantum dynamics. A natural way to do 
it is to compare quantum dynamics with a much simpler stochastic dynamics whose properties 
are well-known or much easier to establish. In this section the most important ingredients 
of elementary stochastic processes are collected and we will introduce the Wiener process. 
Basic knowledge of probability theory (random variables, expectation, variance, independence, 
Gaussian normal random variable, characteristic function) is assumed. 

2.1 The central limit theorem 

Let Vi, i — 1, 2, . . ., be a sequence of independent, identically distributed (denoted by i.i.d. for 
brevity) random variables with zero expectation, = and finite variance, Ev 2 = a 2 . The 
values of Vi are either in M. d or Z d . In case of d > 1, the variance a 2 is actually a matrix (called 
covariance matrix), defined as a 2 h = Eu^V where v = (v ^\ v ( - 2 \ . . . ,v^). The v^s should 
be thought of as velocities or steps of a walking particle at discrete time i. For simplicity we 
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Random walk (Wiener) Lorenz gas (random scat.) 
Stochastic dynamics Random Hamiltonian, one body 




Einstein's model 

Deterministic Hamiltonian 
many body random data. 




Periodic Lorenz gas (billiard) 

Deterministic Hamiltonian 
one body random data. 



Figure 3: Microscopic models for diffusion 



will work in d = 1 dimensions, the extension to the general case is easy. A natural question: 
Where is the particle after n steps if n is big? 

Let 

n 

s„ ■■= 

8=1 

be the location of the particle after n steps. It is clear that the expectation is zero, E S n = 0, 
and the variance is 

E^ = E(5>) = = tut 2 (2.1) 

i=l i=l 

(note that independence via E v{Oj = for i ^ j has been used). 
This suggests to rescale S n and define 

v _ Sn _ J2i=l v i 



then EX n = and EX 2 = a 2 . The surprising fact is that the distribution of X n tends to a 
universal one (namely to the Gaussian normal distribution) as n — > oo, no matter what the 
initial distribution of u» was! This is the central limit theorem, a cornerstone of probability 
theory and one of the fundamental theorems of Nature. 

Theorem 2.1. Under the conditions Et>j = and E v f = a 2 < oo the distribution of the 
rescaled sum X n of the i.i.d random variables Vi converges to the normal distribution, i.e 

X n X 
10 



in the sense of probability distributions where X is a random variable distributed according to 
N(0,a 2 ). 

We recall that the density of the normal distribution in M. 1 is given by 

1 _ * 2 



and the convergence is the standard weak convergence of probability measures (or convergence 
in distribution or convergence in law): 

Definition 2.2. A sequence of random variables X n converges to the random variable X in 
distribution, X n =>- X if 

~EG(X n ) ^EG(X) 
for any continuous bounded function G : K — > R 

In particular, X n converges in distribution to N(0, a 2 ) if 



E G(X n ) -)■ J G(x)f(x)dx. 

Analogous definitions hold in higher dimensions. 

Recall that we will have two scales: a microscopic and a macroscopic one. In this example, 
the microscopic scale corresponds to one step of the walking particle and we run the process 
up to a total of n units of the microscopic time. The macroscopic time will be the natural 
time scale of the limit process and it will be kept order one even as n — > oo. Recalling that 
we introduced e < 1 as a scaling parameter, we now reformulate the central limit theorem in 
this language. 

Let T > be a macroscopic time (think of it as an unsealed, order one quantity) and let 



n := [Te 



-ii 



where [■] denotes the integer part. Let again Et>j = 0, Ev 2 = a 2 . From the central limit 
theorem it directly follows that 



[Te- 1 ] 



X £ T := J2 v i^ X ^> as £ ^ 0, (2.2) 

£ i=i 

where Xt is a normal Gaussian variable with variance Ta 2 and with density function 



Note that the normalization in ( 12. 2 p is only by y/ e~ l and not by \/n ~ vTV^, therefore the 
limit depends on T (but not on e, of course). 

Since the Gaussian function fr{X) gives the probability of finding Xt at location x, we 
have 

f T (X)dX RJ Prob{Xf is located at X + dX at time T}. 
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Note that we used macroscopic space and time coordinates, X and T. Translating this state- 
ment into the original process in microscopic coordinates we see that 

f T (X)dX « Prob{finding S n at e~V 2 (X + dX) at time n « e _1 T}. 

Note that the space and time are rescaled differently; if (X, T) denote the macroscopic 
space/time coordinates and (x,t) denote the microscopic coordinates, then 

x = e~ 1/2 X, t = s~ x T. 

This is the typical diffusive scaling, where 

time = (distance) 2 . (2.4) 

Finally, we point out that the limiting density function fr(X) satisfies the heat equation with 

diffusion coefficient a 2 : 

d T f T {X) = a 2 d 2 x f T {X). (2.5) 

In fact, the Gaussian function ( 12. 3 p is the fundamental solution to the heat equation also in 
higher dimensions. 

The emergence of the normal distribution and the heat equation is universal; note that 
apart from the variance (and the zero expectation) no other information on the distribution 
of the microscopic step v was used. The details of the microscopic dynamics are wiped out, 
and only the essential feature, the heat equation (or the normal distribution) with a certain 
diffusion coefficient, remains visible after the e — > scaling limit. 

We remark that the central limit theorem (with essentially the same proof) is valid if v^s 
have some short time correlation. Assume that Vi is a stationary sequence of random variables 
with zero mean, Efj = 0, and let 

R{i,j) := EviVj 

the correlation function. By stationarity, R(i,j) depends only on the difference, i — j, so let 
us use the notation 

R(i — j) := Ev^j 

This function is also called the velocity-velocity autocorrelation function. 

Theorem 2.3. Assume that the velocity-velocity autocorrelation function is summable, i.e. 

oo 

R{k) < oo. 

k=— oo 

Then the rescaled sums 

X c T : = £ V2 £ Vi 
i=i 

converge in distribution to a normal random variable, 

X e T ^X T , 0, 
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whose density function fr{X) satisfies the heat equation 



d T f = DAf T 

with a diffusion coefficient 

oo 

D := Yl 

k=—oo 

assuming that D is finite. This relation between the diffusion coefficient and the sum (or 
integral) of the velocity-velocity autocorrelation function is called the Green-Kubo formula. 

Finally we mention that all results remain valid in higher dimensions, d > 1. The role 
of the diffusion coefficient is taken over by a diffusion matrix defined by the correlation 
matrix of the single step distribution: 

Dij = Ev (i) v {j) 

where v = (uW,w' 2 ', . . . ,v^). The limiting heat equation ( 12. 5 p in the uncorrelated case is 
modified to 

d 

d T f T (X) = DydfiMX). 

In particular, for random vectors v with i.i.d. components the covariance matrix is constant 
D times the identity, and we obtain the usual heat equation 

d T f T (X) = DA x f T (X) 

where D is called the diffusion coefficient. 

2.2 Markov processes and their generators 

Let Xt, t > 0, be a continuous-time stochastic process, i.e. X t is a one-parameter family of 
random variables, the parameter is usually called time. The state space of the process is the 
space from where X t takes its values, in our case X t G M. d or 7L d . As usual, E will denote the 
expectation with respect to the distribution of the process. 

Definition 2.4. X t is a Markov process if for any t > t 

Dist(X t | {X s } se[0 , r] ) = Dist(X t | X T ) 

i.e. if the conditional distribution of X t conditioned on the family of events X s in times 
s G [0,r] is the same as conditioned only on the event at time r. 

In simple words it means that the state, X t , of the system at time t depends on the past 
between times [0, r] only via the state of the system at the last time of the past interval, r. 
All necessary information about the past is condensed in the last moment. 
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The process depends on its initial value, X . Let denote the expectation value assuming 
that the process started from the point x G M. d or Z d , i.e. 



E x <p(X t ) = VW(X t ) I X = x}. 



Here and in the sequel <p will denote functions on the state space, these are also called observ- 
ables. We will not specify exactly the space of observables. 

Markov processes are best described by their generators: 

Definition 2.5. The generator of the Markov process X t is an operator acting on the observ- 
ables ip and it is given by 

d 



(£*)(*) = - 



£=0+0 



E x ip{X £ ). 



This definition is a bit lousy since the function space on which £ act is not defined. 
Typically it is either the space of continuous functions or the space of L 2 -functions on the 
state space, but for both concept an extra stucture - topology or measure - is needed on 
the state space. Moreover, the generator is typically an unbounded operator, not defined on 
the whole function space but only on a dense subset. For this lecture we will leave these 
technicalities aside, but when necessary, we will think of the state space M. d , Z d equipped with 
their natural measures and for the space of functions we consider will the L 2 -space. 

The key fact is that the generator tells us everything about the Markov process itself. Let 
us demonstrate it by answering two natural questions in terms of the generator. 

Question 1: Starting from Xq = x, what is the distribution of X{i 
Answer: Let <p be an observable (function on the state space) and define 

f t (x) :=E x (p(X t ). (2.6) 

We wish to derive an evolution equation for f t : 



d t f t (x) = lim -E x E x ^{X t ) - E x <p(X t ) 

e->0 £ I 

since by the Markov property 

E x v(X t+£ ) = E x E Xe ( V (X t )), 
where X t is a new copy of the Markov process started at the (random) point X e . Thus 



dtft(x) 



lim -Ea- 

e^O e 



f t (X E ) - f t (x) 



d_ 

di 



£ = 0+0 



E x f t {X £ ) = {Cf t ){x). 



Therefore f t , defined in (12. 6p . solves the initial value problem 

dtft = Cf t with f (x) = <p(x) 
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Formally one can write the solution as 



If the observable is a Dirac delta function at a fixed point y in the state space, 

(p{x) = 5 y {x), 

then the solution to the above initial value problem is called the transition kernel of the 
Markov process and it is denoted by p t (x,y): 

d t Pt(-,y) = Cpt(-, y) with p (-,y) = 5 y . 

The intuitive meaning of the transition kernel is 

Pt(x, y)dy := Prob{After time t the process is at y + dy if it started at x at time 0}. 



Question 2: Suppose that the initial value of the process, X = x, is distributed according 
to a density function ip(x) on the state space. What is the probability that after time t the 
process is at y? 

Answer: 

gt(y) '■= Prob(X t at y after time t) = J %p(x)p t (x,y)dx. 

It is an exercise to check (at least formally), that g t solves the following initial value problem 

d t g t = C*g t with g = tp, 

where C* denotes the adjoint of C (with respect to the standard scalar product of the L 2 of 
the state space). 



2.3 Wiener process and its generator 

The Wiener process is the rigorous mathematical construction of the Brownian motion. There 
are various ways to introduce it, we choose the shortest (but not the most intuitive) definition. 

First we need the concept of the Gaussian process. We recall that the centered Gaussian 
random variables have the remarkable property that all their moments are determined by the 
covariance matrix (a random variable is called centered if its expectation is zero, EX = 0). 
If (Xi, X 2 , . . . , X2k) is a centered Gaussian vector- valued random variable, then the higher 
moments can be computed by Wick's formula from the second moments (or two-point 
correlations) 

EX 1 X 2 ...X 2k = II EX ^> ( 2 - 7 ) 

7T pairing (i,j)ew 

where the summation is over all possible (unordered) pairings of the indices {1,2,... 2k}. 

A stochastic process X t G M. d is called Gaussian, if any finite dimensional projection 
is a Gaussian vector- valued random variable, i.e. for any t± < t 2 < ■ ■ ■ < t^, the vector 
(X tl ,X t2 , . . . , X tn ) G M, dn is a Gaussian random variable. 
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Definition 2.6. A continuous Gaussian stochastic process W t = (Wf \ . . . , wi d) ) eR d ,t> 0, 
is called the d- dimensional (standard) Wiener process if it satisfies the following 

i) W = 

a) E\v t = o 

in) EW s {a) W t {b) = mm{s,t}5 ab . 

From Wick's formula it is clear that all higher moments of W t are determined. It is a 
fact that the above list of requirements determines the Wiener process uniquely. We can now 
extend the central limit theorem to processes. 

Theorem 2.7. Recall the conditions of Theorem \2.1\ The stochastic process 

ITs- 1 ] 

X T ■= ^ 1/2 E V * 

(defined already in (I2.2p ) converges in distribution (as a stochastic process) to the Wiener 
process 

X £ T =► W T 

as e — )■ . 



The proof of this theorem is not trivial. It is fairly easy to check that the moments 
converge, i.e. for any k G N d multiindex 



rk 
T 



E [X £ T f -> E W, 
and the same holds even for different times: 

E [X Tl t [X T2 ] k2 ■ ■ ■ [X Tm ] km E Wj?Wg . . . Wt, 

but this is not quite enough. The reason is that a continuous time process is a collection of 
uncountable many random variables and the joint probability measure on such a big space 
is not necessarily determined by finite dimensional projections. If the process has some ad- 
ditional regularity property, then yes. The correct notion is the stochastic equicontinuity (or 
Kolmogorov condition) which provides the necessary compactness (also called tightness in this 
context). We will not go into more details here, see any standard book on stochastic processes. 

Theorem 2.8. The Wiener process on M d is Markovian with generator 

Idea of the proof. We work in d = 1 dimension for simplicity. The Markovity can be easily 
checked from the explicit formula for the correlation function. The most important ingredient 
is that the Wiener process has independent increments: 



E (W t - W S )W U = {21 
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if u < s < t; i.e. the increment in the time interval [s,t] is independent of the past. The 
formula (12. 8 P is trivial to check from property (hi). 
Now we compute the generator using Definition 12.51 



de £=o+o 



e <p(w e ) 



d_ 

de e=o+o 



E (p(0) + (p'(0)W s + -<p"(0) W 2 e + ... 

\ 2 




Here we used that EoWe = 0, EqW 2 = e and the dots refer to irrelevant terms that are higher 
order in e. □ 

Finally we remark that one can easily define a Wiener process with any nontrivial (non- 
identity) diffusion coefficient matrix D as long as it is positive definite (just write D = A* A and 
apply the linear transformation W — > AW to the standard Wiener process). The generator 
is then 



2.4 Random jump process on the sphere S and its generator 

The state space of this process is the unit sphere S = S^ 1 in M. d . We are given a function 

<t(v,u) :SxS^R + 

which is interpreted as jump rate from the point v to u. 

The process will be parametrized by a continuous time t, but, unlike the Wiener process, 
its trajectories will not be continuous, rather piecewise constant with some jumps at a discrete 
set of times. A good intuitive picture is that the particle jumping on the unit sphere has a 
random clock (so-called exponential clock) in its pocket. The clock emits a signal with the 
standard exponential distribution, i.e. 



and when it ticks, the particle, being at location v, jumps to any other location mgS according 
to the distribution u — > o~(v, u) (with the appropriate normalization): 



i.J=l 



Probjthere is a signal at t + dt} = e dt, 



Prob{The particle from v jumps to u + du} 



o~(v, u)du 



J a(v, u)du 



The transition of the process at an infinitesimal time increment is given by 




u + du with probability ea(v t , u)du 
v t with probability 1 — e J a(v t ,u)du 



(2.9) 



up to terms of order 0(e 2 ) as e — > 0. 

Thus the generator of the process can be computed from Definition 12.51 Let 



ft(v) := E v (p(v t ), 
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assuming that the process starts from some given point vq = v and tp is an arbitrary observable. 
Then 



d t f t (v)= lim ~E v (E Ve tp(v t ) - E v <p(v t 

e->0+0 £ \ 

du a(v, u) E u (p(v t ) - E v cp(v t 
du a(v,u) [f t (u) - f t {v)], 



where we used the Markov property in the first line and the infinitesimal jump rate from (12. 9ft 
in the second line. Note that with probability 1 — eo~(v, u) we have v £ = v. Thus the generator 
of the random jump process is 

(Cf)(v):= J dua(v,u)[f(u)-f(v)]. (2.10) 

Note that it has two terms; the first term is called the gain term the second one is the loss 
term. The corresponding evolution equation for the time dependent probability density of the 
jump process, 

dtf t (v) = [du a(v,u)[f t (u) - f t (v)] (2.11) 



is called linear Boltzmann equation in velocity space. 

The elements of the state space S will be interpreted as velocities of a moving particle 
undergoing fictitious random collisions. Only the velocities are recorded. A velocity distribu- 
tion f(v) is called equilibrium distribution, if Cf = 0; it is fairly easy to see that, under some 
nondegeneracy condition on a, the equilibrium exists uniquely. 



3 Classical mechanics of a single particle 

Classical mechanics of a single particle in (^-dimensions is given by a Hamiltonian (energy 
function) defined on the phase space M. d x M. d : 

H(v,x) := ^v 2 + U{x). (3.1) 

Here x is the position, v is the momentum coordinate. For most part of this notes we will not 
distinguish between momentum and velocity, since we almost always consider the standard 
kinetic energy |v 2 . 

The Hamiltonian equation of motions is the following set of 2d coupled first order differ- 
ential equations: 

x(t) = d v H = v v(t) = -d x H= -VU(x). (3.2) 
For example, in case of the free evolution, when the potential is zero, U = 0, we have 

x(t) = x + v t (3.3) 

i.e. linear motion with a constant speed. 
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Instead of describing each point particle separately, we can think of a continuum of (non- 
interacting) single particles, described by a phase space density f(x,v). This function is 
interpreted to give the number of particles with given velocity at a given position, more 
precisely 

f(x, v)dxdv = Number of particles at x with velocity v such that (x,v) e A. 

Another interpretation of the phase space density picture is that a single particle with 
velocity vq at xo can be described by the measure 

f(x,v) = 5(X - X )5(V - V ), 

and it is then natural to consider more general measures as well. 

The system evolves with time, so does its density, i.e. we consider the time dependent 
phase space densities, f t (x, v). For example in case of a single particle governed by (13.21) with 
initial condition x(0) = x$ and v (0) = t>o, the evolution of the phase space density is given by 

ft(x, v) = 5(x - x(t))5(v - v(t)), 

where (x(t),v(t)) is the Hamiltonian trajectory computed from (13. 2\\ . 

It is easy to check that if the point particle trajectories satisfy (13. 2p . then f t satisfies the 
following evolution equation 

(d t + v- V x )f t (x, v) = VU(x) ■ V v f t (x, v), (3.4) 

which is called the Liouville equation. The left hand side is called the free streaming term. 
The solution to 

(d t + vV x )f t (x,v) = (3.5) 
is given by the linear transport solution 

ft(x,v) = f (x - vt,v), 

where fo is the phase space density at time zero. This corresponds to the free evolution (13.31) 
in the Hamiltonian particle representation. 

3.1 The linear Boltzmann equation 

The linear Boltzmann equation is a phenomenological combination of the free flight equation 
(13. 5p and the jump process on the sphere of the velocity space (12.111) 

(d t + v - V x )f t (x,v) = J a(u,v)[f t {x,u) - f t {x,v)]du. (3.6) 

Note that we actually have the adjoint of the jump process (observe that u and v are inter- 
changed compared with (I2.1ip ). so the solution determines: 

ft(x,v) = Probjthe process is at (x,v) at time t}, 
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given the initial probability density fo(x,v) (Question 2 in Section |2T2|) . 

The requirement that the velocities are constrained to the sphere corresponds to energy 
conservation. Of course there is no real Hamiltonian dynamics behind this equation: the 
jumps are stochastic. 

Notice that the free flight plus collision process standing behind the Boltzmann equation 
is actually a random walk in continuous time. In the standard random walk the particle 
"jumps" in a new random direction after every unit time. In the Boltzmann process the 
jumps occur at random times ("exponential clock"), but the effect on a long time scale is 
the same. In particular, the long time evolution of the linear Boltzmann equation is diffusion 
(Wiener process) in position space. 

The following theorem formulates this fact more precisely. We recall that to translate the 
velocity process into position space, one has to integrate the velocity, i.e. will consider 

x t = v s ds. 
Jo 

Theorem 3.1. Let Vt be a random velocity jump process given by the generator (12. lip . Then 

fT/e 

X e {T) =: e 1/2 / v t dt -> W T (in distribution,) 
Jo 

where Wt is a Wiener process with diffusion coeffient being the velocity autocorrelation 

POO 

D — R(t)dt, R(t) := Ev v t . 
Jo 

Here E is with respect to the equilibrium measure of the jump process on S d ~ x . 



4 Quantum mechanics of a single particle 

4.1 Wavefunction, Wigner transform 

The state space of a quantum particle in ^-dimensions is L 2 (M. d ) or L 2 (Z d ). Its elements 
are called L 2 -wavefunctions and they are usually denoted by if) = ip{x) G L 2 (R d ) or L 2 (Z d ). 
We assume the normalization, \\1pW2 = 1- We recall the interpretation of the wave function: 
the position space density |i/>(:e)| 2 cLe gives the probability of finding an electron at a spatial 
domain: 

|^(a;)| 2 dx = Probjthe particle's position is in A}. 
Similarly, the Fourier transform of t/>, 

$(v) := / e- iv - x tp(x)dx, 
Jm d 

determines the momentum space density: 

|V , (f)| 2 df = Probjthe particle's momentum is in A}. 
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In the lattice case, i.e. when Z d replaces M. d as the physical space, the Fourier transform 
is replaced with Fourier series. In these notes we neglect all 27r's that otherwise enter the 
definition of the Fourier transform. 

By the Heisenberg uncertainty principle, one cannot simultaneously determine the position 
and the momentum of a quantum particle, thus the concept of classical phase space density 
does not generalize directly to quantum mechanics. Nevertheless one can define a substitute for 
it, namely the Wigner transform. For any L 2 -wavefunction ip we define the Wigner transform 
of ip as 

W<,(x,v):= J^(x + ^(x-^)e^dz, 

and we still interpret it as "quantum phase space density" . 

It is easy to check that is always real but in general is not positive (thus it cannot be 
the density of a positive measure - in coincidence with the Heisenberg principle). However, its 
marginals reconstruct the position and momentum space densities, as the following formulas 
can be easily checked: 

J W4x,v)dv = mx)\ 2 , J W^(x,v)dx = mv)\ 2 . 

In particular, for normalized wave functions \\1pW2 = 1, we have 

Wj,(x,v)dvdx = l. (4.1) 



// 



We remark, that for the lattice case some care is needed for the proper definition of the 
Wigner transform, since x ± | may not be a lattice site. The precise definition in this case is 

W4x,v):= e^-^WW, (4.2) 

y,z£Z d 
y-\-z = 2x 

where tp G £ 2 (Z d ), i.e. y,z G Z d , but x G (Z/2) d . The formulas for marginal of the Wigner 
transform modify as follows: 

W^(x,v)dv = 2 d \^(x)\ d 
if x G Z d and it is zero if x G (Z/2) d \ Z d . We still have 

/ W4x,v)dx = 2- d J2 W+(x,v) = \$( 

x€ (Z/2J 

and 

dx / dvW^(x, v ) - 1 ■ 



v)\ 2 



(z/2)' 

Often we will use the Wigner transform in the Fourier representation, by which we will 
always mean Fourier transform in the first (x) variable only, i.e. with the convention 



/(0 = J e- ix *f(x)dx, 
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we define 

After a simple calculation, we have 



W*(t,v) = $(y- £)#(« + £). (4-3) 
In the discrete case we have £ G R / (2 • 2nTL d ) and 



More generally, if J(x, v) is a classical phase space observable, the scalar product 

(J,Wip) = / J(x,v)Wip(x, v)dxdv 



can be interpreted as the expected value of J in state ip. Recall that "honest" quantum 
mechanical observables are self-adjoint operators O on L 2 (IR d ) and their expected value is 
given by 

{ip,Oif)) = I 4){x) {Oip){x)dx. 



For a large class of observables there is a natural relation between observables O and their 
phase space representations (called symbols) that are functions on the phase space like J(x, v). 
For example, if J depends only on x or only on v, then the corresponding operator is just the 
standard quantization of J, i.e. 



J(x)W^(x,v)dxdv = (ip, Jip) 
where J is a multiplication operator on the right hand side, or 

J(v)W i) (x,v)dxdv = (V>,J(-iV)V>), 



and similar relations hold for the Weyl quantization of any symbol J(x,v). 

We also remark that the map ip — > is invertible, i.e. one can fully reconstruct the 
wave function from its Wigner transform. On the other hand, not every real function of two 
variables (x, v) is the Wigner transform of some wavefunction. 

4.2 Hamilton operator and the Schrodinger equation 

The quantum dynamics is generated by the Hamilton operator 

H = -~A X + U(x) 



acting on ip G L (R ). The first term is interpreted as the kinetic energy and it is the quan 

I 

2 



tization of the classical kinetic energy ^v 2 (compare with (13.11) ). The momentum operator is 
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Figure 4: Evolution of slower (left) and a faster (right) wave packet 



p = — iV and we set the mass equal one, so momentum and velocity coincide. [To be pedantic, 
we should use the word momentum everywhere instead of velocity; since the momentum is the 
canonical quantity, then the dispersion relation, e(p), defines the kinetic energy as a function 
of the momentum operator, and its gradient, v = Ve(p) is the physical velocity Since we will 
mostly use the dispersion relation e(p) = \p 2 ! , the momentum and velocity coincide. However, 
for the lattice model the velocity and the momentum will differ.] 
The evolution of a state is given by the Schrodinger equation 

idti/H = Hif> t = (-^A + C/)^ 
with a given initial data ipt=o — i>o- Formally, the solution is 

^ t = e- itH i> . 

If H is self-adjoint in L 2 , then the unitary group e~ ltH can be defined by spectral the- 
orem. Almost all Hamilton operators in mathematical physics are self-adjoint, however the 
self-adjointness usually requires some proof. We will neglect this issue here, but we only men- 
tion that self-adjointness is more than the symmetry of the operator, because H is typically 
unbounded when issues about the proper domain of the operator become relevant. 

Note the complex i in the Schrodinger equation, it plays an absolutely crucial role. It is 
responsible for the wave-like character of quantum mechanics. The quantum evolution are 
governed by phase and dispersion. Instead of giving precise explanations, look at Fig. HJ the 
faster the wave oscillates, the faster it moves. 

We can also justify this picture by some sketchy calculation (all these can be made rigorous 
in the so-called semiclassical regime). We demonstrate that the free (U = 0) evolution of 

ip (x) := e iV0X A(x - x Q ) 

(with some fixed Xo, vq) after time t is supported around xq + vot. Here we tacitly assume 
that A is not an oscillatory function (e.g. positive) and the only oscillation is given explicitly 
in the phase e lv ° x , mimicking a plane wave. 
We compute the evolution of ipo'- 

ijj t {x)= [ e iv{x - y) e- itv2/2 e iV0V A{y-x ) Ay&v 



-■4>o(y) 

e iv x I e iv( X -x ) e -itvy2^ v _ ^ dy _ / e i*(v)£( v _ dv 
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Figure 5: Semiclassical setup: short wavelength, large scale envelope and potential 



with a phase factor 

$(f ) := v (x — x ) tv 2 . 

2 

We apply the stationary phase argument: the integral is concentrated around the stationary 
points of the phase. Thus 

= V„$ — x - x - tv 

gives x = xo + v t pa xo + vot if A is sufficiently localized. 

This argument can be made rigorous if a scale separation ansatz is used. We introduce a 
small parameter e and assume that 

Mx)=£ d/2 e lVoX A(e(x-x )), 

i.e. the wave has a slowly varying envelope (amplitude) and fast phase. The prefactor is chosen 
to keep the normalization ||-?/>o|| = 1 independently of e. Such states and their generalizations 
of the form 

£ d/2 e iS(ex)/e A ( £ ^ _ ^ ) (44) 

are called W KB -states. These states are characterized by a slowly varying amplitude and 
wavelength profile and by a fast oscillation. Similar picture holds if the potential is nonzero, 
but slowly varying, i.e. U(x) is replaced with U(ex). 



4.3 Semiclassics 

The WKB states and the rescaled potential belong to the semiclassical theory of quantum 
mechanics. A quantum mechanical system is in the semiclassical regime whenever all the data 
(potential, magnetic field, initial profile etc.) live on a much bigger scale than the quantum 
wavelength. 

The scale separation parameter is usually explicitly included in the Schrodinger equation, 
for example: 

i^(i)= [-^A x + U(ex)^ t (x). (4.5) 

This equation is written in microscopic coordinates. It becomes more familiar if we rewrite it 
in macroscopic coordinates 

(X,T) = (xe,te), 
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where (14. 5 p takes the form 

r e 2 i 
ied T ^ T {X) = [ - —A x + U(X)\ V T {X). 

Traditionally e is denoted h in this equation and is referred to the (effective) Planck constant, 
whose smallness in standard units justifies taking the limit h — > (the so-called semiclassical 
limit). No matter in what coordinates and units, the essential signature of the semiclassical 
regime is the scale separation between the wavelength and any other lengthscales. 

Wigner transform W^(x,v), written originally in microscopic coordinates, can also be 
rescaled: 

W;(X,V) :=e- d Wi,(j,Vy 

Note that apart from the rescaling of the X variable (V variable is unsealed!), a normalizing 
prefactor e~ d is added to keep the integral of W e normalized (see (14. ip ) 



J J W;(X,V)dXdV 



1. 



The following theorem is a simple prototype of the theorems we wish to derive in more 
complicated situations. It shows how the quantum mechanical Schrodinger evolution can be 
approximated by a classical equation in a certain limiting regime: 

Theorem 4.1. Consider a sequence of initial states ipQ whose rescaled Wigner transform has 
a weak limit: 

hmW; f) (X,V)^W (X,V). 

For example, the WKB states given in (14. 4 p have a weak limit 

W (X,V) = \A(X)\ 2 S(V -vspo). 

Let denote the solution to the semiclassical Schrodinger equation (14. 5 p with initial state 
i[)q . Then the weak limit of the rescaled Wigner transform of the solution at time t = T/e, 



W T {X, V) := lim WL (X,V) 

satisfies the Liouville equation: 

(d T + V- Vx)W T (X, V) = VU(X) ■ VvWt(X, V). (4.6) 

Recall that the Liouville equation is equivalent to the classical Hamilton dynamics (see the 
derivation of (13. 4p ). 

In this way, the Liouville equation (14. 6 p mimics the semiclassical quantum evolution on 
large scales (for a proof, see e.g. [10]). Notice that weak limit is taken; this means that the 
small scale structure of the Wigner transform is lost. Before the weak limit, the Wigner 
transform of carries all information about r/>^, but the weak limit means testing it against 
functions that live on the macroscopic scale. This phenomenon will later be essential to 
understand why irreversibility of the Boltzmann equation does not contradict the reversibility 
of Schrodinger dynamics. 
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5 Random Schrodinger operators 



5.1 Quantum Lorentz gas or the Anderson model 

Now we introduce our basic model, the quantum Lorentz gas. Physically, we wish to model 
electron transport in a disordered medium. We assume that electrons are non-interacting, 
thus we can focus on the evolution of a single electron. The disordered medium is modelled 
by a potential describing random impurities (other models are also possible, e.g. one could 
include random perturbations in the kinetic energy term). This can be viewed as the quantum 
analogue of the classical Lorentz gas, where a classical particle is moving among random 
obstacles. 

Definition 5.1. The model of a single particle described by the Schrodinger equation on R d 
or on Z d , 

idtMx) = H-hix), H = -A x + XV u (x), 
is called the quantum Lorentz gas, ifV u (x) is a random potential at location x. 

Notation: The subscript u indicates that V(x) is a random variable that also depends on 
an element u in the probability space. 

More generally, we can consider 

H = H + \V, (5.1) 

where H is a deterministic Hamiltonian, typically H = —A. We remark that a periodic 
background potential can be added, i.e. 

H = -A x + U per (x) + \V w {x) 

can be investigated with similar methods. We recall that the operator with a periodic potential 
and no randomness, 

H = -A x + U per (x), 

can be more or less explicitly diagonalized by using the theory of Bloch waves. The transport 
properties are similar to those of the free Laplacian. This is called the periodic quantum Lorentz 
gas and it is the quantum analogue of the Sinai billiard. However, while the dynamics of the 
Sinai billiard has a very complicated structure, its quantum version is quite straightforward 
(the theory of Bloch waves is fairly simple). 

We also remark that for the physical content of the model, it is practically unimportant 
whether we work on W d or on its lattice approximation Z d . The reason is that we are investi- 
gating long time, large distance phenomenon; the short scale structure of the space does not 
matter. However, technically Z d is much harder (a bit unusual, since typically M. d is harder as 
one has to deal with the ultraviolet regime). If one works on Z d , then the Laplace operator is 
interpreted as the discrete Laplace operator on Z d , i.e. 

(A/)(x):=2d/(x)-X;/(z + e). (5.2) 

|e|=l 
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In Fourier space this corresponds to the dispersion relation 

d 

e(p) = - cosp (j) ) (5.3) 

3=1 

on the torus p G [— 7T, 7i"] d . 

The random potential can be fairly arbitrary, the only important requirement is that it 
cannot have a long-distance correlation in space. For convenience, we will assume i.i.d. random 
potential for the lattice case with a standard normalization: 

{V(x) : xeZ d } i.i.d EV(x) = 0, EV 2 (x) = 1. 

The coupling constant A can then be used to adjust the strength of the randomness. We can 
also write this random potential as 

V(x)=^v a 6(x-a), (5.4) 

a£Z d 

where {v a : a G Z d } is a collection of i.i.d. random variables and 5 is the usual lattice delta 
function. 

For continuous models, the random potential can, for example, be given as follows; 



V(x) = V <* B ( X - 



a) 



where B is a nice (smooth, compactly supported) single site potential profile and {v a : a G 
Z d } is a collection of i.i.d. random variables. It is also possible to let the randomness perturb 
the location of the obstacles instead their strength, i.e. 

v(x)= J2 B ( x -v«(")l 

where, for example, y a {u) is a random point in the unit cell around a G Z , or even more 
canonically, the collection {y a {uj)} a is just a Poisson point process in R d . The combination of 
these two models is also possible and meaningful, actually in our work [21] we consider 

V(x) = j B(x-y)d l 2 UJ (y) (5.5) 

where /j, u is a Poisson point process with homogeneous unit density and with i.i.d. coupling 
constants (weights), i.e. 

^ = ^2v a (u)6 ya{u)) , (5.6) 

a 

where {y a (oj) : a = 1, 2, . . .} is a realization of the Poission point process and {v a } are 
independent (from each other and from y a as well) real valued random variables. 

The lattice model — A + XV with i.i.d. random potentials (15.41) is called Anderson model. It 
was invented by Anderson [4] who was the first to realize that electrons move quite differently 
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in disordered media than in free space or in a periodic background. The main phenomenon 
Anderson discovered was the Anderson localization, asserting that at sufficiently strong dis- 
order (or in d — 1,2 at any nonzero disorder) the electron transport stops. We will explain 
this in more details later, here we just remark that Anderson was awarded the Nobel Prize in 
1977 for this work. 

However, in d > 3 dimensions electron transport is expected despite the disorder if the 
disorder is weak. This is actually what we experience in real life; the eletric wire is conduct- 
ing although it does not have a perfect periodic lattice structure. However, the nature of 
the electric transport changes: we expect that on large scales it can be described by a diffu- 
sive equation (like the heat equation) instead of a wave-type equation (like the Schrodinger 
equation) . 

Note that the free Schrodinger equation is ballistic (due to wave coherence). This means 
that if we define the mean square displacement by the expectation value of the observable x 2 
at state if) t , 



for large t ^> 1 as it can be easily computed. In contrast, the mean square displacement for 
the heat equation scales as t and not as t 2 (see (12.4j) ). 

Thus the long time transport of the free Schrodinger equation is very different from the 
heat equation. We nevertheless claim, that in a weakly disordered medium, the long time 
Schrodinger evolution can be described by a diffusion (heat) equation. Our results answer to 
the intriguing question how the diffusive character emerges from a wave-type equation. 

5.2 Known results about the Anderson model 

We will not give a full account of all known mathematical results on the Anderson model, 
we just mention the main points. The main issue is to study the dichotomic nature of the 
Anderson model, namely that at low dimension (d = 1,2) or at high disorder (A > Xo(d)) 
or near the spectral edges of the unperturbed operator H the time evolved state remains 
localized, while at high dimension (d > 3), at low disorder and away from the spectral edges 
it is delocalized. 

There are several signatures of (de)localization and each of them can be used for rigorous 
definition. We list three approaches: 

i) Spectral approach. If H has pure point (PP) spectrum then the system is in the localized 
regime, if H has absolutely continuous (AC) spectrum then it is in the delocalized regime. 
(The singular continuous spectrum, if appears at all, gives rise to anomalous diffusions). 
It is to be noted that even in the pure point regime the spectrum is dense. 

ii) Dynamical approach. One considers the mean square displacement (15. 7p and the system 
is in the localized regime if 




(5.7) 



then it behaves as 





t>o 



(other observables can also be considered). 
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iii) Conductor or Insulator? This is the approach closest to physics and mathematically it 
has not been sufficiently elaborated (for a physical review, see [37]). In this approach 
one imposes an external voltage to the system and computes the currect and the ohmic 
resistance. 

These criteria to distinguish between localized and delocalized regimes are not fully equiv- 
alent (especially in d = 1 dimensional systems there are many anomalous phenomena due to 
the singular continuous spectrum), but we do not go into more details. 

The mathematically most studied approach is the spectral method. The main results are 
the following: 

i) In d = 1 dimensions all eigenfunctions are localized for all A ^ 0. (Goldsheid, Molchanov 
and Pastur [3~T] ) 

ii) In d > 1 localization occurs for large A or for energies near the edges of the spectrum 
of Hq (see ( 15. ip ). This was first proven by the groundbreaking work of Frohlich and 
Spencer [28] on the exponential decay of the resolvent via the multiscale method, (the 
absence of AC spectrum was then proved by Frohlich, Martinelli, Spencer and Scoppola 
[27] and the exponential localization by Simon and Wolff [43J). Later a different method 
was found by Aizenman and Molchanov [2] (fractional power method). The spectrum is 
not only pure point, but the eigenfunctions decay exponentially (strong localization) . 

iii) Many more localization results were obtained by variants of these methods for various 
models, including magnetic field, acoustic waves, localized states along sample edges etc. 
We will not attempt to give a list of references here. 

Common in all localization results is that the random potential dominates, i.e. in one way 
or another the free evolution H is treated as a perturbation. 

It is important to understand that the transport in the free Schrodinger equation is due to 
the coherence of the travelling wave. Random potential destroys this coherence; the stronger 
the randomness is, the surer is the destruction. This picture is essential to understand why 
random potentials may lead to localization even at energies that belong to the regime of 
classical transport. For example in d = 1 any small randomness stops transport, although the 
classical dynamics is still ballistic at energies that are higher than the maximal peak of the 
potential. In other words, Anderson localization is a truly quantum phenomenon, it cannot 
be explained by a heuristics based upon classically trapped particles. 

5.3 Major open conjectures about the Anderson model 

All the previous results were for the localization regime, where one of the two main methods 
(multiscale analysis or fractional power method) is applicable. The complementary regimes 
remain unproven. Most notably is the following list of open questions and conjectures: 

i) [Extended states conjecture] For small A < Xo(d) and in dimension d > 3 the 
spectrum is absolutely continuous away from the edges of H . In particular, there exists 
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a threshold value (called mobility edge) near the edges of the unperturbed spectrum that 
separates the two spectral types. 

This conjecture has been proven only in the following three cases: 

a) Bethe lattice (infinite binary tree) that roughly corresponds to d — oo (Klein [31] , 
recently different proofs were obtained in [3] and |26j). 

b) Sufficiently decaying random potential, E V 2 = \x\~ a , for a > 1 (Bourgain [Tj). 
The decay is sufficiently strong such that the typical number of collisions with 
obstacles is finite. Note that the randomness is not stationary. 

c) In d — 2 with a constant magnetic field the spectrum cannot be everywhere pure 
point. (Germinet, Klein and Schenker [29|) 

ii) [Two dimensional localization] In d = 2 dimensions all eigenfunctions are localized 
for all A. (i.e. the model in d = 2 dimensions behaves as in d = 1). 

iii) [Quantum Brownian motion conjecture] For small A and d > 3, the location of the 
electron is governed by a heat equation in a vague sense: 

dMx)\ 2 ~ A x |^(x)| 2 {x 2 ) t ~t t»l. (5.8) 

The precise formulation of the first statement requires a scaling limit. The second 
statement about the diffusive mean square displacement is mathematically precise, but 
what really stands behind it is a diffusive equation that on large scales mimics the 
Schrodinger evolution. Moreover, the dynamics of the quantum particle converges to 
the Brownian motion as a process as well; this means that the joint distribution of 
the quantum densities |^(x)| 2 at different times t\ < t 2 < ■ ■ ■ < t n converges to the 
corresponding finite dimensional marginals of the Wiener process. 

Note that the "Quantum Brownian motion conjecture" is much stronger than the "Ex- 
tended states conjecture", since the former more precisely describes how the states extend. 
All these three open conjectures have been outstanding for many years and we seem to be far 
from their complete solutions. 

Fig. [6] depicts the expected phase diagram of the Anderson model in dimensions d > 3. The 
picture shows the different spectral types (on the horizontal energy axis) at various disorder. 
The grey regimes indicate what has actually been proven. 

Our main result, explained in the next sections, is that the "Quantum Brownian motion 
conjecture" in the sense of (15.81) holds in the scaling limit up to times t ~ A -2 ~ K . More pre- 
cisely, we will fix a positive number k and we will consider the family of random Hamiltonians 

H = H X = ~A + XV 

parametrized by A. We consider their long time evolution up to times t ~ A~ 2 ~ K and we take 
A — > limit. We show that, after appropriately rescling the space and time, the heat equation 
emerges. Note that the time scale depends on the coupling parameter. This result is of course 



30 



X 

PP proven 




PP spectrum ?? 



AC spec trum 



-2d 




PP proven 



2d energy 



Expected spectrum at small, nonzero disorder 0< X « 1 



PP 



AC ?? 



PP 



PP proven 



mobility edge 



Figure 6: Phase diagram of the Anderson model in d = 3 

far from either of the conjectures i) and iii) above, since those conjectures require fixing A 
(maybe small but fixed) and letting t — > oo. 

We emphasize that the typical number of collisions is X 2 t. The reason is that the quantum 
rate of collision with a potential that scales as 0(A) is of order A 2 . This follows from simple 
scattering theory: if 

H = -A + XV 

is a Hamiltonian with a single bump potential (i.e. Vo is smooth, compactly supported) and 
ipin denotes the incoming wave, then after scattering the wave splits into two parts; 



t > 1. 



(5.9) 



Here ip sc (t) is the scattered wave (typically spherical if Vo is spherical) while the first term 
describes the transmitted wave that did not notice the obstacle (apart from its amplitude 
is scaled down by a factor /?). Elementary calculation then shows that the scattered and 
transmitted waves are (almost) orthogonal and their amplitudes satisfy 



||lMf)|| a = 0(A a ), /3 2 = 1-0(A 2 ). 

Therefore the incoming wave scatters with a probability 0(A 2 ). 

Thus up to time t, the particle encounters A 2 t collisions. In our scaling 



(5.10) 



n := Number of collisions ~At~A K — > oo. 
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The asymptotically infinite number of collisions is necessary to detect the diffusion (Brownian 
motion, heat equation), similarly as the Brownian motion arises from an increasing number 
of independent "kicks" (steps in random walk, see Theorem 12.7ft . 

6 Main result 

In this section we formulate our main result precisely, i.e. what we mean by that "Quantum 
Brownian motion conjecture" holds in the scaling limit up to times t ~ A _2_K . Before going 
into the precise formulation or the sketch of any proof, we will explain why it is a difficult 
problem. 

6.1 Why is this problem difficult? 

The dynamics of a single quantum particle among random scatterers (obstacles) is a multiple 
scattering process. The quantum wave function bumps into an obstacle along its evolution, 
and according to standard scattering theory, it decomposes into two parts: a wave that goes 
through the obstacle unnoticing it and a scattering wave (15. 9p . The scattered wave later 
bumps into another obstacle etc, giving rise to a complicated multiple scattering picture, 
similar to the one on Fig. [7J Obviously, the picture becomes more and more complicated as 
the number of collisions, n, increases. Finally, when we perform a measurement, we select 
a domain in space and we compute the wave function at the point. By the superposition 
principle of quantum mechanics, at that point all elementary scattering waves have to be 
added up; and eventually there exponentially many of them (in the parameter A -1 ). They 
are quantum waves, i.e. complex valued functions, so they must be added together with their 
phases. The cancellations in this sum due to the phases are essential even to get the right 
order of magnitude. Thus one way or another one must trace all elementary scattering waves 
with an enormous precision (the wavelength is order one in microscopic units but the waves 
travel at distance A -2 between two collisions!) 

It is important to emphasize that this system is not semiclassical. Despite the appear- 
ance of a small parameter, what matters is that the typical wavelength remains comparable 
with the spatial variation lengthscale of the potential. Thus the process remains quantum 
mechanical and cannot be described by any semiclassical method. 

6.2 Scales 

Since the collision rate is 0(A 2 ), and the typical velocity of the particle is unsealed (order 1), 
the typical distance between two collisions (the so-called mean free path) is L = 0(A~ 2 ) and 
the time elapsed between consecutive collisions is also 0(A -2 ). 

6.2.1 Kinetic scale 

In order to see some nontrivial effects of the collisions, we have to run the dynamics at least 
up to times of order A -2 . Within that time, the typical distance covered is also of order 
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Figure 7: Schematic picture of multiple scattering 

A -2 (always understood in microscopic coordinates). Thus the simplest scale that yields a 
nontrivial result is the so-called kinetic scaling, 

t = A" 2 T, x = \~ 2 X, n = \H = 0(1), 

i.e. the space and time are scaled by A 2 and the typical number of collisions X 2 t is of order 1. 

One convenient way to quantify Fig. [7] is that one works in a large quadratic box whose 
size is at least comparable with the mean free path (otherwise the boundary would have a too 
big effect). Thus one can consider the box 

A= [0,L} d 

and put |A| = L d obstacles in it, e.g. one at every lattice site. For defmiteness, we work with 
the Anderson model, i.e. 

H = -A x + A ^ v ^( x ~ oc). 

a&Z d 

Each elementary scattering wave corresponds to a particular (ordered) sequence of random ob- 
stacles. Since the total number of obstacles is L d , after n collisions we have ~ |A| n elementary 
waves to sum up in Fig. [7J thus 

A 

where A = (a±, . . . , ot n ) is a sequence of obstacle locations («j G Z d ) and ipA describes the 
elementary scattering wave that has undergone the collisions with obstacles at ai, 0:2, . . . at n 
in this order. The precise definition of ipA will come later. 
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These waves must be summed up together with their phase. It is easy to see from standard 
scattering theory that a spherical wave decays as A | distance |"( d_1 )/ 2 (the prefactor A comes 
from the fact that the total amplitude of the scattering is scaled down by a factor of A due 
to the coupling constant in front of the potential, see (I5.10j) ). Since the amplitudes multiply, 
and since the typical mean free path is A -2 , after n collisions the typical amplitude of ipA is 

d-l _ _ 

Thus if we try to sum up these waves in (16.11) with neglecting their phases, then 

IVul ~ |A| n A dn = (A~ M )V n = \- dn ->• oo. 

A 

However, if we can sum up these waves assuming that their phases are independent, i.e. they 
could be summed up as independent random variables, then it is the variance that is additive 
(exactly as in the central limit theorem in ( 12. ip ): 

A A 

Thus it is essential to extract a strong independence property from the phases. Since phases are 
determined by the random obstacles, there is some hope that at least most of the elementary 
waves are roughly independent. This will be our main technical achievement, although it will 
be formulated in a different language. 

We remark that in the physics literature, the independence of phases is often postulated as 
an Ansatz under the name of "random phase approximation" . Our result will mathematically 
justify this procedure. 

6.2.2 Diffusive scale 

Now we wish to go beyond the kinetic scale and describe a system with potentially infinitely 
many collisions. Thus we choose a longer time scale and we rescale the space appropriately. 
We obtain the following diffusive scaling: 

t = \~ K \- 2 T, x = \- k/2 \~ 2 T, n = \H = X K . 

Notice that the time is rescaled by an additional factor A _K compared with the kinetic scaling, 
and space is rescaled by the square root of this additional factor. This represents the idea that 
the model scales diffusively with respect to the units of the kinetic scale. The total number 
of collisions is still n = X 2 t, and now it tends to infinity. 

If we try to translate this scaling into the elementary wave picture, then first we have to 
choose a box that is larger than the largest space scale, i.e. L > \~ 2 ~ K I 2 . The total number 
of elementary waves to sum up is 

|Ap = {\- 2 -^) dn ~ A ^ K 
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i.e. superexponentially large. Even with the assumption that only the variances have to be 
summable (see (16. 2p ). we still get a superexponentially divergent sum: 



i^i 2 ^Ei^i 2 ~i A i nA2 



rid 



y2nd \— ndre/2 y^ — (const)\ 



1 2nd 

, \WA\ ~ KM < 

A 

We will have to perform a renormalization to prevent this blow-up. 



6.3 Kinetic scale: (linear) Boltzmann equation 

The main result on the kinetic scale is the following theorem: 

Theorem 6.1. /Boltzmann equation in the kinetic scaling limit/ Let the dimension be 
at least d > 2. Consider the random Schrddinger evolution on IR d or Z d 

idti> t = Hi> t , H = H X = ~A + XV(x), 

where the potential is spatially uncorrelated (see more precise conditions below). Consider the 
kinetic rescaling 

t = A" 2 T, x = \- 2 X, 

with setting e := A 2 to be the scale separation parameter in space. 

Then the weak limit of the expectation of the rescaled Wigner transform of the solution ip t 
exists, 

ew; t/s (x,v)^f t (x,v) 

and Ft satisfies the linear Boltzmann equation, 



(d T + Ve{V) ■ V x )f t (X, V) = J dUa(U, V) F T (X, U) - F r (X 



V) 



Here e(V) is the dispersion relation of the free kinetic energy operator given by e(V) = \V 2 
for the continuous model and by (15. 3p for the discrete model. The collision kernel o~(U, V) is 
explicitly computable and is model dependent (see below). 

The velocities U, V are interpreted as incoming and outgoing velocities, respectively, see 
Fig. O The collision kernel always contains an onshell condition, i.e. a term S(e(U) — e(V)) 
guaranteeing energy conservation (see the examples below). Thus the right hand side of the 
linear Boltzmann equation is exactly the generator of the jump process on a fixed energy shell 
(in case of e(V) = \V 2 it is on the sphere) as defined in Section [2T4l 

This theorem has been proven in various related models. 

• [Continuous model on M. d ] The random potential is chosen to be a homogeneous 
Gaussian field, this means that the random variable V(x) is Gaussian and it is stationary 
with respect to the space translations. We assume that E V(x) = 0, then the distribution 
of V(x) is uniquely determined by the two-point correlation function 

R(x-y) :=EV(x)V(y), 
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Figure 8: Incoming and outgoing momentum in a collision 

and we assume that R is a Schwartz function. The dispersion relation of the free Lapla- 
cian on R d is 

e(p) := ^P 2 , 

and the Boltzmann collision kernel is computed from the microscopic data as 

<r(U, V) = 5(e(U) - e(V))\R(U - V)\. (6.3) 

This result was first obtained by Spohn [IS] for short macroscopic time, T < T , and 
later extended to arbitrary times in a joint work with Yau [21] with a different method. 
We assume that the dimension d > 2, with a special non- degeneracy condition on the 
initial data in case of d = 2 forbidding concentration of mass at zero momentum. The 
d — 1 case is special; due to strong interference effects no Markovian limiting dynamics 
is expected. 

• [Discrete model + Non Gaussian] The method of [21] was extended by Chen in 
two different directions p3]. He considered the discrete model, i.e. the original Anderson 
model and he allowed non-Gaussian randomness as well. In the case of Laplacian on Z d 
the dispersion relation is more complicated (15. 3p . but the collision kernel is simpler; 

a(U,V) = 8(e(U)-e(V)), (6.4) 

i.e. it is simply the uniform measure on the energy surface given by the energy of the 
incoming velocity. 

• [Phonon model] In this model the random potential is replaced by a heat bath of 
non-interacting bosonic particles that interact only with the single electron. Formally, 
this model leads to an electron in a time dependent random potential, but the system 
is still Hamiltonian. This model is the quantum analogue of Einstein's picture. The 
precise formulation of the result is a bit more complicated, since the phonon bath has 
to be mathematically introduced, so we will not do it here. The interested reader can 
consult with [18]. More recently, De Roeck and Frohlich [10] have proved diffusion for 
any times in d > 4 dimensions with an additional strong spin coupling that enhances 
the loss of memory. 

• [Cubic Schrodinger equation with random initial data] A nonlinear version of 
the Boltzmann equation was proven in [39] where the Schrodinger equation contained a 
weak cubic nonlinearity and the initial data was drawn from near thermal equilibrium. 
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[Wave propagation in random medium] It is well known that in a system of harmon- 
ically coupled oscillators the wave propagate ballistically If the masses of the oscillators 
deviate randomly from a constant value, the wave propagation can change. The effect is 
similar to the disturbances in the free Schrodinger propagation; the ballistic transport is 
a coherence effect that can be destroyed by small perturbations. This model, following 
the technique of [21], has been worked out in [38J. 



• [Low density models] There is another way to reduce the effects of possible recollisions 
apart from introducing a small coupling constant: one can consider a low density of 
obstacles. Let Vo be a radially symmetric Schwartz function with a sufficiently small 
weighted Sobolev norm: 

|| (^^(V^VolL is sufficiently small for some N(d) sufficiently large. 

The Hamiltonian of the model is given by 

1 M 

H = — A x + 22 v o( x ~ x a) in a box [-L, L} d , L > e' 1 

a=l 

acting on M d , d > 3, where {x a } a= i t ^M denotes a collection of random i.i.d. points, 
with density 

M 

The kinetic scaling corresponds to choosing 

x ~ e^ 1 , t ~ e^ 1 , e — g, 

and then letting e — » 0. In this case convergence to the linear Boltzmann equation in the 
spirit of Theorem 16 . 1 1 was proven in [20], [17] (although these papers consider the d = 3 
case, the extension to higher dimensions is straightforward). The dispersion relation is 
e(p) = and the collision kernel is 

a(U, V) = \T(U, V)\ 2 5(U 2 - V 2 ) 
where T(U, V) is the quantum scattering cross section for — |A + Vo- 
lt is amusing the compare the weak-coupling and the low-density sceneries; they indeed 
describe different physical models (Fig. [9]). Although in both cases the result is the linear 
Boltzmann equation, the microscopic properties of the model influence the collision kernel. 
In particular, in the low density model the full single-bump scattering information is needed 
(while the function R (I6.3P in the weak coupling model can be viewed as the Born approxima- 
tion of the full scattering cross section). The scaling is chosen such that the typical number 
of collisions remains finite in the scaling limit. Note that in both models the wavelength is 
comparable with the spatial variation of the potential; both live on the microscopic scale. 
Thus neither model is semiclassical. 

We make one more interesting remark. When compared with the corresponding classical 
Hamiltonian, the low density model yields the (linear) Boltzmann equation both in classical 
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Figure 9: Low density and weak coupling sceneries 



and quantum mechanics (although the collision kernels are different). The weak coupling 
model yields Boltzmann equation when starting from quantum mechanics, however it yields 
a random walk on the energy shell (for the limiting velocity process v t ) when starting from 
classical mechanics [33] and 135] . For further references, see 121 1. 



Fig. HOI depicts schematically the microscopic and macroscopic scales in Theorem 16. II The 
right side of the picture is under a microscope which is "sees" scales comparable with the 
wavelength (Angstrom scale). Only one obstacle and only one outgoing wave are pictured. In 
reality the outgoing wave goes in all directions. On this scale the Schrodinger equation holds: 



id t ip t (x) 



-A x + XV(x) 



ipt(x) 



On the left side we zoomed our "microscope" out. Only a few obstacles are pictured that are 
touched by a single elementary wave function. All other trajectories are also possible. The 
final claim is that on this scale the dynamics can be described by the Boltzmann equation 

(d T + Ve{V) • V X ^F T {X, V) = J dUa{U, V) [f t {X, U) - F r {X, V) . 

Actually the obstacles (black dots) on the left picture are only fictitious: recall that there are 
no physical obstacles behind the Boltzmann equation. It represents a process, where physical 
obstacles are replaced by a stochastic dynamics: the particle has an exponential clock and it 
randomly decides to change its trajectory i.e. to behave as if there were an obstacle (recall 
Section [2.41) . It is crucial to make this small distinction because it is exactly the fictitious 
obstacles are the signatures that all recollisions have been eliminated hence this guarantees 
the Markov property for the limit dynamics. 
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Figure 11: Boltzmann semigroup models Schrodinger evolution 
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The main message of the result is that the long time (t = Te _1 ) Schrodinger evolution can 
modelled by a finite time (T) Boltzmann evolution on the macroscopic scale. Of course the 
detailed short scale information is lost, which explains irreversibility. The effective limiting 
equation is classical, but quantum features are retained in the collision kernel. 

Note that this approximation is not just conceptually interesting but it also gives rise to an 
enormous computational simplification. Solving the Schrodinger equation directly for a long 
time in an complicated environment is a computationally impossible task. In contrast, the 
Boltzmann equation needs to be solved on 0(1) times and there is no complicated environment. 
This is expressed schematically in the "commutative diagram" Fig. (TTJ The top horizontal line 
is the true (Schrodinger) evolution. The theorem says that one can "translate" the quantum 
initial data by Wigner transform and scaling limit into a phase space density /(O) = fo{X, V) 
and one can use the much simpler (and shorter time) Boltzmann evolution to /o to obtain the 
Wigner transform (after expectation and scaling limit) of the true evolution ip t . 

6.4 Diffusive scale: Heat equation 

We consider the same model with a weakly coupled random potential as before. For definite- 
ness, we work on the lattice Z d , d > 3, the Hamiltonian is given by 

H = -A + XV, 

where the discrete Laplacian is given by (15. 2 p with dispersion relation (15. 3 p and where the 
random potential is 

V(x) = v a 8(x — a). 

The random variables v a are i.i.d. with moments rrik =En* satisfying 

mi = m 3 = m 5 = 0, m 2( 2 < oo. 

Let 

i> t := e- itH iP , 
and we rescale the Wigner transform as before: 

W^^W; t =e- d W^ t {X/e,v). 

We remark that in case the discrete lattice Z d the definition of the Wigner function W q p(x,v) 
is given by (14.21) . Our main result is the following 

Theorem 6.2. [Quantum diffusion in the discrete case] For any dimension d > 3 there 
exists Ko(d) > [in d = 3 one can choose k (3) = YoSoo/> suc ^ that for any k < K (d) and any 
ipo ^ L 2 {Ij d ) the following holds. In the diffusive scaling 

t = X K X- 2 T, x = X- K/2 X- 2 X, e = X- K/2 - 2 , 

we have that 

/ E Wl {X, v) dv ->> f T (X, e) weakly as X -> 0, (6.5) 

J{e(v)=e} 
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and for almost all e > the limiting function satisfies the heat equation 



d T f T {X, e) = V x • D(e) V x f T (X, e) (6.6) 
with a diffusion matrix given by 

Dij(e) = ^ Ve{v) <S> Ve(t>)^> , (f(v)) e = Average of f on {v : e(v) = e] (6.7) 

and with initial state 

f (X,e) :=5(X) [ 5(e(v)-e)\Mv)\ 2 dv. 



The weak convergence in (16.101) means that for any Schwartz function J(x, v) on R d x R d we 
have 



e->-0 



lim / dX / dvJ(X,v)EW; (x - K - 2T) (X,v) = / dX / dvJ(X,v)f T (X,e(v)) 

(fZj J JRd, 



uniformly in T on [0, To], where T an arbitrary fixed number. 

This result is proven in The related continuous model is treated in [251 123] . A concise 
summary of the ideas can be found in the expository paper [22]. Note again that weak limit 
means that only macroscopic observables can be controlled. Moreover, the theorem does 
not keep track of the velocity any more, since on the largest diffusive scale the velocity of the 
particle is not well defined (similarly as the Brownian motion has no derivative). Thus we have 
to integrate out the velocity variable on a fixed energy shell (the energy remains macroscopic 
observable). This justifies the additional velocity integration in (16. 5 ]) . 

The diffusion matrix is the velocity autocorrelation matrix (computed in equilibrium) 
obtained from the Boltzmann equation 



POO 

D B oitz{e):= £ e [Ve(u(0))®Ve(v(*))],dt (6. 



similarly to Theorem 12 .31 Here S e denotes the expectation value of the Markov process v(t) 
described by the linear Boltzmann equation as its generator if the initial velocity v(0) is 
distributed according to the equilibrium measure of the jump process with generator (I2.10p 
on the energy surface {e(v) = e} with the Boltzmann collision kernel (16. 4p . Since this kernel 
is the uniform measure on the energy surface, the Boltzmann velocity process has no memory 
and thus it is possible to compute the velocity autocorrelation just by averaging with respect 
to the uniform measure on {e(v) = e}: 

D Bo it z (e) = £ e [Ve(«(0)) <g> Ve{v{t))]dt = ( Ve( v) <g) Ve(f) ) . (6.9) 

In particular, Theorem 16.21 states that the diffusion matrix D(e) obtained from the quantum 
evolution coincides with the velocity autocorrelation matrix of the linear Boltzmann equation. 
For completeness, we state the result also for the continuum model. 
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Theorem 6.3. [Quantum diffusion in the continuum case] Let the dimension be at least 
d > 3 and i(jq G L 2 (M. d ). Consider the diffusive scaling 

t = \- K \- 2 T, x = \- K/2 \- 2 X, e = \- K ' 2 - 2 . 

Let 

i>t = e^o, H = -~A + XV, 

where the random potential is given by (15. 5p with a single site profile B that is a spherically 
symmetric Schwartz function with in the support of its Fourier transform, G supp B. For 
d > 3 there exists Ko{d) > (in d = 3 one can choose kq(3) = 1/370,) such that for any 
k < n (d) we have 

/ E Wl(X,v) dv -± f T {X,e) (weakly as A -> 0), (6.10) 

J{e(v)=e} 

and for almost all energies e > the limiting function satisfies the heat equation 

d T f T (X,e) = D e A x f T (X,e) 

with initial state 

f (X,e) :=5(X) J 5(e(v)~e)\Mv)\ 2 dv. 
The diffusion coefficient is given by 

1 f 00 

D e :=~ d J o £e [v(0)-v(t)] dt, (6.11) 

where £ e is the expectation value for the random jump process on the energy surface {e(v) = e} 
with generator 

a{u, v) = \B{u — v)\ 2 5(e(u) — e(v)). 

The condition G supp B is not essential for the proof, but the statement of the theo- 
rem needs to be modified if the support is separated away from zero. In this case, the low 
momentum component of the initial wave function moves ballistically since the diameter of 
the energy surface is smaller than the minimal range of B. The rest of the wave function still 
moves diffusively. 

Fig. [12] shows schematically the three scales we discussed. Notice that going from the 
Boltzmann scale to the diffusive scale is essentially the same as going from random walk to 
Brownian motion in Wiener's construction (Theorems 12.31 and 12.71) . However it is misleading 
to try to prove Theorem 16.21 by a two-step limiting argument, first using the kinetic limit 
(Theorem 16.11) then combining it with Wiener's argument. There is only one limiting parame- 
ter in the problem, one cannot take first the kinetic limit, then the additional A~ K — > oo limit 
modelling the long kinetic time scale. The correct procedure is to consider the Schrodinger 
evolution and run it up to the diffusive time scale. In this way, the intermediate Boltzmann 
scale can give only a hint what to expect but it cannot be used for proof. 
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Diffusive scale: X, T 



Kinetic scale 



Atomic scale: x, t 




HEATEQ. ^ ^ BOLTZMANNEQ. SCHRODINGER EQ. 

Scaling limit of RW [Wiener] Quantum kinetic limit [Spohn, E-Yau] 




Quantum diffusive limit TE-Salmhofer-Yaul 



Figure 12: Three scales: diffusive, kinetic and atomic 
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In fact, this is not only a mathematical subtlety. Correlations that are small on the 
kinetic scale, and were neglected in the first limit, can become relevant on longer time scales. 
Theorem 16.21 shows that it is not the case; at least up to times t ~ A~ 2 ~ K the correlations 
are not strong enough to destroy the diffusive picture coming from the long time limit of the 
Boltzmann equation (in particular the diffusion coefficient can be correctly computed from 
the Boltzmann equation). However, this should not be taken for granted, and in fact it is not 
expected to hold in d = 2 dimensions for exponentially long times, t ~ exp(A _1 ). Theorem 16. II 
on the kinetic limit is valid in d = 2 as well and Wiener's argument is dimension independent. 
On the other hand if the diffusive evolution were true up to any time scales, then in particular 
the state would be delocalized, in contrast to the conjecture that the random Schrodinger 
operator in d = 2 dimensions is always localized. 

7 Feynman graphs (without repetition) 
7.1 Derivation of Feynman graphs 

Feynman graphs are extremely powerful graphical representations of perturbation expansions. 
They have been primarily used in many-body theories (they were invented for QED), but they 
are very convenient for our problem as well to organize perturbation expansion. They have 
been used by physicist to study random Schrodinger operators, a good overview from the 
physics point of view is |46j . 

We will derive the Feynman graphs we need for this presentation. Recall that 

H = -A + \V } V=J2 V ^ EV Q = 0, (7.1) 

where V a is a single-bump potential around a G Z d , for example 

V a (x) = v a 8(x — a). 

We consider the potential as a small perturbation of —A and we expand the unitary evolution 
by the identity (Duhamel formula) 

fa = e - itH ij = e itA ^j -iX [ e- i{t - s)H Ve lsA ^ ds. (7.2) 

Jo 

This identity can be seen by differentiating U(t) := e~ dH e %tH ° where H = H + \V, H = — A: 

W = e -itH(_ iH + iHo ytH = e -ttH^xV)e itHo , 

CI / 

thus we can integrate back from to t 

U(t) = I+ [ ds e- i{t - s)H (-t\V)e i(t - s)Ho 
Jo 

and multiply by e~ ttH ° from the right. 
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We can iterate the expansion (17. 2p by repeating the same identity for e l< -' S ^ H . For 
example, the next step is 

ft- f* S\ 

e - itH ip = e itA ^ -tX e i{t - s)A Ve isA ^ ds + X 2 / dsi / ds 2 e~ i{t - Sl - S2)H Ve iS2A Ve iSlA ilJo 

Jo Jo Jo 

etc. We see that at each step we obtain a new fully expanded term (second term), characterized 
by the absence of the unitary evolution of H; only free evolutions e~ lsA appear in these terms. 
There is always one last term that contains the full evolution and that will be estimated 
trivially after sufficient number of expansions. More precisely, we can expand 



N-l 



n=0 

where 

„ n 

^W(t) : = (-iX) n / ds d Sl . . .ds n 5(t - V s 8j 

J K + j=0 



e i S0 A Ve i S1 A v _ _ _ Ve iSnA tp 



and 

V N (t) := (-iA) / dse-^-^V^-^is). (7.4) 



Recalling that each V is a big summation ( 17.11) . we arrive at the following Duhamel formula 



N-l 



ipt = yj ^~]il>A + fuh evolution term, (7.5) 

n=0 A 

where the summation is over collision histories: 

A :=(oti, a 2 , a n ), «j G Z d , 
and each elementary wave function is given by 

i> A := Mt) = HA) n J <WG0 e lSoA V ai --- V a „e is " A Vo- (7-6) 
Here, for simplicity, we introduced the notation 

/ d/x„ )t (s) := / <$(t - ^Sj jds dsi . . . ds n (7.7) 

for the integration over the simplex So + s% + . . . + s n — t, Sj > 0. 

If the cardinality of A is n = \A\, then we say that the elementary wave function is of 
order n and their sum for a fixed n is denoted by 

tf< n >(t):= ^( t ). 

A : |A|=n 
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a, a„ a„ a, 

1 2 3 4 -is A 

Y A = • • • • with propagator: = e 



Figure 13: Representation of a wave function with collision history A 

Note that the full evolution term has a similar structure, but the leftmost unitary evolution 
e ls ° A is replaced with e~ ls ° H . 

We remark that not every elementary wave function has to be expanded up to the same 
order. The expansion can be adjusted at every step, i.e. depending on the past collision 
history, encoded in the sequence A, we can decice if an elementary wave function %pA should 
be expanded further or we stop the expansion. This will be essential to organize our expansion 
without overexpanding: once the collision history A indicates that is already small (e.g. 
it contains recollisions), we will not expand it further. 

Each ipA can be symbolically represented as shown on Fig. [131 where lines represent the 
free propagators and bullets represent the potential terms (collisions). Note the times between 
collisions is not recorded in the graphical picture since they are integrated out. 

We need to compute expectation values of quadratic functionals of if) like the Wigner 
transform. For simplicity we will show how the L 2 -norm can be treated, the Wigner transform 
is very similar (see next Section). 

Suppose for the moment that there is no repetition in A, i.e. all a/s are distinct. To 
compute E||^|| 2 = E tpip, we need to compute 

E E E e<^,^>. 

n,ri A :\A\=n B:\B\=ri 

This is zero unless B is a permutation of A, because individual potentials have zero expecta- 
tion. Higher order moments of the same potential V a are excluded since we assumed that A 
and B have no repetition. Thus the only nonzero contributions come from second moments, 
i.e. from pairings of each element of A with an element of B, in particular, \A\ = \B\ is forced, 
i.e. n = n'. 

In conclusion, with the tacit assumption that there are no repetitions, we can express 
E || ^|| 2 as a summation over the order n and over permutations between the sets A and B 
with \A\ = \B\ = n: 

EE E E(^ S ,^) = EE E e^.w^EE* ( 7 - 8 ) 

n,n' A :\A\=n B:\B\=n' n ir£S n A :\A\=n n 7rG5„ 

where S n is set of permutations on n elements and upper star denotes summation over non- 
repetitive sequences. Val(7r) is defined by the last formula by summing up all sequences A; 
this summation will be computed explicitly and it will give rise to delta functions among 
momenta. 

Drawing the two horizontal lines that represent ipA and ips parallel and connecting the 
paired bullets, we obtain Fig. HU where the first summation is over the order n, the second 
summation is over all permutations it e S n . a graph, called the Feynman graph of n. We will 
give an explicit formula for the value of each Feynman graph, here denoted by Val(7r). 
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=ZZVal(7c) 



71(2) 71(3) 7U(1) 



7U 



Figure 14: Representation of the L 2 -norm as sum of Feynman graphs 

7.2 L 2 -norm vs. Wigner transform 

Eventually we need to express 

E(J,W^) := J J(x,v)BW^(x,v)dxdv = J J(£, u)E W^, v)d£dv = E(J, W*) 

(recall that hat means Fourier transform only in the x variable). However, we have 

Lemma 7.1. For any Schwarz function J, the quadratic functional ip >->■ E(J, W^) is contin- 
uous in the L 2 -norm. 

Proof. Let ip,<f> G L? and set 

ft :=e((J,^>-(J,F?^>) =E J dtdvJ{t,v)\w^,v) -W^v) 

Let i;± = v ± | and write 



-0K)) + (^(u_) -0(u_))^(u+). 



Thus 



|Q|< y"d^|J(e,^|E|[.]| < J d£sup\J(Z,v)\ JdvE\[-} 



< j d£sup|J(^)| y dt;E^(^)|-|VK)-0K)| + |^(^-)-0(^)|-|0K)l 
By Schwarz inequality, 

| dvE | • \$(v + ) -?(«+) |) < ( | dt;E|^_)| 2 ) 1/2 ( | d V E|?( V+ ) - 0K)| 2 ) 

(tO| 2 ) V2 (/d^E|#;)-^)| 2 ) 



1/2 



dvEl 



1/2 



= (eiihi 2 ) 1/2 (e||^-0|| 2VI j 



(7.9) 



Thus 



|n| < ( J d^ sup |j(£, W ) i) (x/eIW + v^HF) (e||? - 0|| 2 ) V2 , 
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which completes the proof of the Lemma. □ 

This lemma, in particular, guarantees that it is sufficient to consider nice initial data, i.e. 
we can assume that ipo is a Schwarz function. More importantly, this lemma guarantees that 
it is sufficient to control the Feynman diagrammatic expansion of the L 2 -norm of ip; the same 
expansion for the Wigner transform will be automatically controlled. Thus all estimates of 
the error terms can be done on the level of the L 2 -norm; the more complicated arguments 
appearing in the Wigner transform are necessary only for the explicit computation of the main 
term. 



7.3 Stopping the expansion 



We will never work with infinite expansions, see ( I7.3p . but then we will have to control the last, 
fully expanded term. This will be done by using the unitarity of the full evolution e~^* _s ^ 
in the definition of ^n, see (I7.4p . More precisely, we have: 



Lemma 7.2 (Unitarity bound). Assume that 



1. According to ( 17. 3p . we write ip{t) 



&N(t) + ^N(t) with $/v = Y2 n =o containing the fully expanded terms. Th 



en 



E 



jv(t)/ 



< 1 



'E $ 



N 



sup E\\\V<4> 



(N-l), 



■,1/2 



0<s<t 



In other words, the difference between the true wave function ip(t) and its iV-th order 
approximation $tv can be expressed in terms of fully expanded quantities, but we have to 
pay an additional price t. This additional t factor will be compensated by a sufficiently 
long expansion, i.e. by choosing N large enough so that ■0( JV ~ :L )(s) is small. In practice we 
will combine this procedure by stopping the expansion for each elementary wave function ipA 
separately, depending on the collision history. Once ipA is sufficiently small to compensate for 
the t factor lost in the unitarity estimate, we can stop its expansion. 

Proof. We apply Lemma [7.11 with if) = ip(t) and = <3>at, so that ip — = ^n- Then we 
have, for any N, that 



E 



(J, W m ) - (J, W* N{t) )] < (l + veii^ii 2 ) (e||vM 2 ) 



1/2 



since 



1. Then using the unitarity, i.e. that 



-i(t-s)H - 



we can estimate 



EHtf 112 



N 



= E 
< E 



Jo 



ds\\\Vip 



(N-l), 



<t ds E\\\Vip 



(N-l), 



which proves Lemma [7.21 



(7.10) 
□ 
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Figure 15: Contribution of the ladder graph 



7.4 Outline of the proof of the Boltzmann equation with Feynman 
graphs 

The main observation is that among the n\ possible permutations of order n, only one, the 
identity permutation contributes to the limiting Boltzmann equation. Actually it has to be 
like that, since this is the only graph whose collision history can be interpreted classically, 
since here A = B as sets, so the two wavefunctions ipA and i^b visit the same obstacles in the 
same order. If there were a discrepancy between the order in which the obstacles in A and B 
are visited, then no classical collision history could be assigned to that pairing. 

The Feynman graph of the identity permutation is called the ladder graph, see Fig. (T5J Its 
value can be computed explicitly and it resembles to the n-th term in the Taylor series of the 
exponential function. 

The ladder with n pairings corresponds to a classical process with n collisions. If the 
collision rate is <tA 2 , then the classical probability of n collisions is given by the (unnormalized) 
Poisson distribution formula 

Prob (n collisions) = - - ^ . 

n\ 

This is clearly summable for the kinetic scale t ~ A~ 2 , and the typical number of collisions is 
the value n giving the largest term, i.e. n ~ X 2 t. The exponential damping factor e -^ m ( ) x2t 
in Fig. [15] comes from renormalization (see later in Section 18.41 where the constants a and 9 
will also be defined). 

We note that from the Duhamel formula leading to the Feynman graphs, it is not at 
all clear that the value of the ladder graphs is of order one (in the limiting parameter A). 
According to the Duhamel representation, they are highly oscillatory integrals of the form 
(written in momentum space) 



* r s \ r s n s / ft 



•i 



VaX(id n ) ~ A 2n (J e isip i j e ls ^ ...J )[J e~ is ^ j e~ is ^ ...J J. (7.11) 

Here id n is the identity permutation in S n which generates the ladder. If one estimates these 
integrals naively, neglecting all oscillations, one obtains 

\2nx2n 

ValK) < w , 

which is completely wrong, since it does not respect the power counting that A 2 t should be 
the relevant parameter. 
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Figure 16: Comparing the value of various crossing graphs 

Assuming that the ladder can be computed precisely, with all oscillations taken into ac- 
count, one still has to deal with the remaining n\ — 1 non-ladder graphs. Their explicit formulas 
will reveal that their values are small, but there are many of them, so it is not clear at all if 
they are still summable to something negligible. 

It is instructive to compare the value of the non-ladder graphs with the ladder graph 
because that is of order one (see Section 151)1 for a detailed computation). Fig. [TBI contains 
schematic estimates of simple crossing graphs that can be checked by direct computation (once 
the explicit formula for Val(7r) is established in the next section). It indicates that the value of 
a non-ladder graph is a positive power of A that is related with the combinatorial complexity 
of the permutation (e.g. the number of "crosses"). 

Accepting for the moment that all non-ladder graphs are at least by an order A 2 smaller 
than the ladder graphs, we obtain the following majorizing estimate for the summation of the 
terms in the Duhamel expansion: 

£ £ I Val(vr) | < £ i -^T 1 + ( n! - W 8 ^) ) • 

n tt n A=B A _ /zB 

This series clearly converges for short kinetic time (A 2 t = T < T ) but not for longer times, 
since the two n!'s cancel each other and the remaining series is geometric. This was the 
basic idea of Spohn's proof [15] and it clearly shows the limitation of the method to short 
macroscopic times. 

The source of the 1/n! is actually the time-ordered multiple time integration present in 
( 17. 7P ; the total volume of the simplex 

{(si, 8 2 ,..., s n ) : < Sl < . . . < s n < t} (7.12) 

is t n /n\. However, (17. lip contains two such multiple time integrations, but it turns out that 
only one of them is effective. The source of the competing n\ is the pairing that comes from 
taking the expectation, e.g. in case of Gaussian randomness this appears explicitly in the 
Wick theorem. The reason for this mechanism is the fact that in quantum mechanics the 
physical quantities are always quadratic in if), thus we have to pair two collision histories in 
all possible ways even if only one of them has a classical meaning. This additional n\ is very 
typical in all perturbative quantum mechanical argument. 
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This was only the treatment of the fully expanded terms, the error term in f lT.5 jl needs a 
separate estimate. As explained in details in Lemma 17. 2\ here we simply use the unitarity of 
the full evolution 

[ e l{t ' s)H [ Ve lSlA Ve lS2A ...d(i ntS {s) < t sup||^f||. (7.13) 



price 



In this estimate we lose a factor of t, but we end up with a fully expanded term ipf which can 
also be expanded into Feynman graphs. 

How to go beyond Spohn's argument that is valid only up to short macroscopic time? No- 
tice that "most" graphs have many "crosses" , hence they are expected to be much smaller due 
to phase decoherence than the naive "one-cross" estimate indicates. One can thus distinguish 
graphs with one, two and more crossings. From the graphs with more crossings, additional 
A 2 -factors can be obtained. On the other hand, the combinatorics of the graphs with a few 
crossings is much smaller than n\. 

On kinetic scale the following estimate suffices (here N denotes the threshold so that the 
Duhamel expansion is stopped after N collisions): 



N-l 




n=0 n ' ladder one cross rest 

unit.price ladder one cross two cross 

Finally, we can optimize 

N = N(X) ~ (logA)/(loglogA) 

to get the convergence. This estimate gives the kinetic (Boltzmann) limit for all fixed T 
(t = TA -2 ). This concludes the outline of the proof of Theorem 16.11 

For the proof of Theorem I6.2[ i.e. going to diffusive time scales, one needs to classify 
essentially all diagrams; it will not be enough to separate a few crosses and treat all other 
graphs identically. 



8 Key ideas of the proof of the diffusion (Theorems 16.2 
and [6131) 



We start with an apologetical remark. For pedagogical reasons, we will present the proof of a 
mixed model that actually does not exists. The reason is that the dispersion relation is simpler 
in the continuum case, but the random potential is simpler in the discrete case. The simplicity 
of the dispersion relation is related to the fact that in continuum the level sets of the dispersion 
relation, {v : e(v) — e}, are convex (spheres), thus several estimates are more effective. The 
simplicity of the random potential in the discrete case is obvious: no additional form-factor B 
and no additional Poisson point process needs to be treated. Moreover, the ultraviolet problem 
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is automatically absent, while in the continuous case the additional large momentum decay 
needs to be gained from the decay of B. Therefore, we will work in the discrete model (i.e. no 
form factors B, no Poisson points and no large momentum problem), and we will denote the 
dispersion relation in general by e(p). All formulas hold for both dispersion relations, except 
some estimates that will be commented on. The necessary modifications from this "fake" 
model to the two real models are technically sometimes laborous, but not fundamental. We 
will comment on them at the end of the proof. 

8.1 Stopping rules 

The Duhamel formula has the advantage that it can be stopped at different number of terms 
depending on the collision history of each elementary wave function. Thus the threshold N 
does not have to be chosen uniformly for each term in advance; looking at each term 

HA) n J e-' lSoH V ai --- V a „e is " A dfi n>t (s) 

separately, we can decide if we wish to expand e~ ls ° H further, or we rather use the unitary 
estimate (|7.13|) . The decision may depend on the sequence (ati, . . . , a n ). 

Set K := (X 2 t)X~ s (with some 5 > 0) as an absolute upper threshold for the number of 
collisions in an expanded term (notice that K is much larger typical number of collisions). We 
stop the expansion if we either have reached K expanded potential terms or we see a repeated 
a-label. This procedure has the advantage that repetition terms do not pile up. This stopping 
rule leads us to the following version of the Duhamel formula: 

K-l r t K 

= E <t + / e ~ i{t ' s)H (v&, + E ^Z) ds (s- 1 ) 

n=0 ^° n=0 

s v ' 

with CnV= E vw, E ^ ( 8 - 2 ) 

A : nonrep. \A\=n 

first rep. at a n 

Here ijj nr contains collision histories with no repetition, while the repetition terms ip rep contain 
exactly one repetition at the last obstacle, since once such a repetition is found, the expansion 
is stopped. In particular, the second sum above runs over A = («]_,... a n ) where a n = 
atj for some j < n and this is the first repetition. Actually the precise definition of the 
elementary wave functions in ( 18. 2ft is somewhat different from ( 17.61) (this fact is indicated by 
the stars), because the expansion starts with a potential term while if) a i n (17.61) starts with a 
free evolution. 

The backbone of our argument is the following three theorems: 
Theorem 8.1. [Error terms are negligible] We have the following estimate 

supE||Cl=o(t- 2 ). 

s<t 
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In particular, by the unitarity estimate this trivially implies that 

"t 2 
e - l{ t-s)H^err ds 



E 







0(1). 



(8.3) 

(\H)\- S = 0(\- K ~ s ) we have 

(8.4) 

^ = vui Wig yiu n ) t uyi). (8.5) 

Here Val^ig denotes the value of the Feynman graphs that obtained by expanding the 
Wigner transform instead of the L 2 -norm of tp t - Note that (I8.5P follows from (j8.4p by (the 
proof of) Lemma [7. II 

Theorem 8.3. [Wigner transform of the main term] The total contribution of the ladder 
terms up to K collisions, 



Theorem 8.2. [Only the ladder contributes] Forn < K 

Elixir = Val(id n ) +o(l) 
EW th nr = Val Wig {id n ) + o(l) 



A' 



Val 



Wig{id n ), 



n=0 



satisfies the heat equation. 

In the following sections we will focus on the non-repetition terms with n < K, i.e. on 
Theorem 18.21 We will only discuss the proof for the L 2 -norm ( 18 .4p . the proof of (I8.5P is a 
trivial modification. Theorem 18. II is a fairly long case by case study, but it essentially relies on 
the same ideas that are behind Theorem 18.21 Some of these cases are sketched in Section [10J 
Finally, Theorem 18. 3 1 is a non-trivial but fairly explicit calculation that we sketch in Section ITTl 



8.2 Feynman diagrams in the momentum-space formalism 

Now we express the value of a Feynman diagram explicitly in momentum representation. 
We recall the formula ( I7.6P and Fig. [13] for the representation of the collision history of an 
elementary wave function. We will express it in momentum space. This means that we assign 
a running momentum, po, pi, . . . , p n to each edge of the graph in Fig. [13] Note that we work on 
the lattice, so all momenta are on the torus, p G T d = [—it, 7r] d ; in the continuum model, the 
momenta would run over all M. d . The propagators thus become multiplications with e~ tSje ^ p ^ 
and the potentials become convolutions: 



(p) = / f[d Pj [ d^^e-^W^^-pije^^'tfe - p 2 ) . . . Mp n ) 

n n poo n i 

^.L.W Pi L da e -""n a - e(K)+ ,„ ^fe--^^^)- < 8 - 6 > 



rp=l/t"^ ' 3=1 

In the second step we used the identity 



„ n 

/ 8[t — Sj j dsodsi . . . ds r 



e -isoe(po)g-«sie(pi) e -is„e(p„) 



3=0 

it 



e ir]t / dae- iQt Y\ (8.7) 

' 7=1 a ~ e \Pi) + l V 
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that holds for any 77 > (we will choose rj = 1/t in the applications to neutralize the expo- 
nential prefactor). This identity can be obtained from writing 



3=0 

and integrating out the times. [Note that the letter a is used here and in the sequel as the 
conjugate variable to time, while previously <x, denoted obstacles. We hope this does not lead 
to confusion, unfortunately this convention was adopted in our papers as well.] 

The L 2 -norm is computed by doubling the formula (18.61) . We will use the letters Pi, ■ ■ - p n 
for the momenta in if) a and primed momenta for ipB'- 

eh</ch 2 = e|| ^f= E %.^>=EE Val W' 

A ■. nonrep A,B nonrcp n 7ren n 

u 



/CO " 
dad/3 e i{a ~P ]t \\ 
OO „•_ n 



| = o a - e (Pj) -*V P- e (Pj) + ir l 

n n 

X A 2 "]p(Vl -Pi) - (K(i)-l -p: (j) ))\MPn)m P n -p'jUdpjdp*. 

j=l j=0 

After the pairing, here we computed explicitly the pair expectations: 



E ]T V a (p)V a (q) = eMP ~ 9) = 6 (P ~ ?). (8-9) 

and we also used that only terms with B = tt(A) are nonzero. This formula holds for the 
simplest random potential (15 .4p in the discrete model; for the continuum model (I5.5P we have 



Ej2v a (p)V a (q) = \B(p)\ 2 5(p-q) (8.10) 

a 

According to the definition of in the continuum model (I5.6p . a labels the realizations of 
the Poisson point process in the last formula (I8.10p . 

The presence of the additional delta function, 5{p n — p' n ), in (18. 8 p is due to the fact 
that we compute the L 2 -norm. We remark that the analogous formula in the expansion for 
Wigner transform differs only in this factor; the arguments of the two wave functions in the 
momentum representation of the Wigner transform are shifted by a fixed value £ (14. 3p . thus 
the corresponding delta function will be 5{p n — p' n — £). 

A careful reader may notice that the non-repetition condition on A imposes a restriction 
on the summation in (18.91) ; in fact the summation for ctj is not over the whole Z d , but 
only for those elements that are distinct from the other a^s. These terms we will add to 
complete the sum (18 .9p and then we will estimate their effect separately. They correspond to 
higher order cumulants and their contribution is negligible, but technically they cause serious 
complications; essentially a complete cumulant expansion needs to be organized. We will 
shortly comment on them in Section [10.11 we will neglect this issue for the moment. 
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This calculation gives that 



2 



as it is pictured in Fig. [TU More concisely, 

Val(Tr) = A V / dpdp'dad/3 e u ^ TT -L 1 , A^(p, p') \Mp n )\ 

J j=o a ~ e ( p i> ~ l V P- e iPj) + IV 

(8.11) 

where p and p' stand for the collection of integration momenta and 

n 

A 7r (p,p') := 5(p n -p' n ) H<y((Pj-l-Pj) - (pjr(j)-l 

i=i 

contains the product of all delta functions. These delta functions can be obtained from the 
graph: they express the Kirchoff law at each pair of vertices, i.e. the signed sum of the four 
momenta attached to any paired vertices must be zero. It is one of the main advantages of the 
Feynman graph representation that the complicated structure of momentum delta functions 
can be easily read off from the graph. 

We remark that in ( 18. lip we omitted the integration domains; the momentum integrations 
runs through the momentum space, i.e. each pj and p'j is integrated over M. d or T d , depending 
whether we consider the continuum or the lattice model. The da and d/3 integrations always 
run through the reals. We will adopt this short notation in the future as well: 

//* fi Th rt fi 

dpdp'dad/3= / T\d Pj / TTdp'. / da / d/3. 
7( T d)n+l J(T d )"+ 1 ~Lq JR JR 

We will also often use momentum integrations without indicating their domains, which is 
always R d or T d . 



8.3 Lower order examples 

In this section we compute explicitly a few low order diagrams for the Wigner transform. 
Recalling that = 5^=0 V ,(n ^ represents the fully expanded terms and recalling the Wigner 
transform in momentum space (14. 3p . we can write the (rescaled) Wigner transform of $tv a s 
follows: 

N-1N-1 N-1 

n'=0 n=0 n,n'=0 

Set k n = v + |£ and k' n , := v — §£, then we can write 

/„ ri— 1 n 
J j=0 j=l 
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which can be doubled to obtain W% n , t . Notice that we shifted all integration variables, i.e. 
we use kj = Vj + |£. 

Case 1 : n — n' — 0. In this case there is no need for taking expectation and we have 



where Wfj is the Wigner transform of the initial wave function t/>q. Suppose, for simplicity, 
that the initial wave function is unsealed, i.e. it is supported near the origin in position space, 
uniformly in A. Then it is easy to show that its rescaled Wigner transform converges to the 
Dirac delta measure in position space: 



]xmWL(X,v)dXdv = 5(X)\ij (v)\ 2 dXdv 

since for any fixed £, 



Mh)MK) ^ \Mv)\ 2 (8-12) 

as e — > 0. In the sequel we will use the continuous dispersion relation e(k) = \k 2 for simplicity 
because it produces explicit formulas. In the general case, similar formulas are obtained by 
Taylor expanding e(k) in e up to the first order term and higher order terms are negligible 
but need to be estimated. 
Since 

1/ £ \ 2 1 / £ \ 2 
e(k ) -e(ko) = -\v- -^J - -\v + -£) = -ev ■ £, 

we get that in the kinetic limit, when X 2 t = T is fixed and e = A 2 

limy" J(Z,v)Wl n ,^,v)d£dv = J J(Z,v)\Mv)\ 2 e-* Tv <d£dv = J J(Tv,v)\Mv)\ 2 dv, 

by using dominated convergence and ( 18.121) . The last formula is the weak formulation of 
the evolution of initial phase space measure 5(X)\ipo(v)\ 2 dXdv under the free motion along 
straight lines. 

Case 2 : n = 0, n' — 1 or n = 1, n' = 0. Since there is one single potential in the expansion, 
these terms are zero after taking the expectation: 

Case 3 : n = n' = 1. We have 

Wf lt (£,v)=(-i\) f dse-^-^ [ dkoVih-^e-^Mko) 



x (zA) / ds' e l(t - s)e ^ dk'oVi^-k'^e^'^MK)- (8-13) 



The expectation value acts only on the potentials, and we have (using now the correct con- 
tinuum potential) 



EV(h - k )V(k[ - k' ) = \B(h - k )\ 2 5(k[ -k' - (h - k )). 
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After collecting the exponents, we obtain 

EW?,i,t(£,v) = A 2 JdkodKlBih - k )\ 2 S(k[ - K - (h - ho)) 

x e^^^^g^e^) - e(k ))Q t (e(k[) - e(A£))&(Wo(*&), 
where we introduced the function 

p ita i 

Qt(a) := 



HI 



Consider first the case £ = 0, i.e. when the Wigner transform becomes just the momentum 
space density. Then v = k\ = k[ and thus k = k' by the delta function, so e{ki) — e(k ) = 
e(k[) — e(k' ) and we get 

EWT^M = >? j dkolBih - fc )| 2 |^o(A:o)| 2 |^(e(A:i) -e^o))! 2 - 

Clearly 

r l \Q t {A)\ 2 2nS(A) 
as t — ?• oo (since J (^^) 2 dx = n), so we obtain that in the limit t — >■ oo, A 2 t = T fixed, 

E% il)t (0,i;)->T y dA; | J B(A; 1 -A;o)| 2 27r5(e(A; 1 )-e(A;o))|^o(^o)| 2 = r / dfco<7(*i, k )\Mk ) | 2 

(8.14) 

(recall fci = v in this case), where the collision kernel is given by 



a 



(h,k ) := \B(h - k Q )\ 2 2Ti5{e{k 1 ) -e(k )). 



A similar but somewhat more involved calculation gives the result for a general £, after 
testing it against a smooth function J. Then it is easier to use (I8.13P directly, and we get, 
after taking expectation and changing variables, 



EWZ iltt (£,v) = A 2 fds [ da' [ dv e l */ 2 \B(v-v )\ 2 W^ 
Jo Jo J 

with a total phase factor 

* := "(* " s)(y + |^) 2 -s(v + |e) 2 + (* - s')(v - "-if + s'(v - 



vo) 



(s - s'){v 2 - vl) - 2e [(* - - S -±^v 



After changing variables: b = s — s' and T = e ^4^-, we notice that the first summand gives 



dbe ib{v2 - v ^ = 2ti6(v 2 -v 2 ), (8.15) 
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and from the second one we have 



EW7 lM (e,«) = jf dT | d^ e-4( T - T o)^]V(^t; )iy o £ (e^o). 

These steps, especially (18.151) . can be made rigorous only if tested against a smooth function 
J, so we have 

E(J,W ltl , t ) -> J dT J dv j((T -T )v-T v , v y(v,v )\Mv )\ 2 . 

The dynamics described in the first variable of J is a free motion with velocity vq up to time 
Tq, then a fictitious collision happens that changes the velocity from Vq to v (and this process is 
given by the rate a(v, Vq)) and then free evolution continues during the remaining time T — T 
with velocity v. This is exactly the one "gain" collision term in the Boltzmann equation. 

Case 4 : n = 0, n' = 2 or n = 2, n' = 0. 

The calculation is similar to the previous case, we just record the result: 
EW a , ,*(e,«) = -A 2 *&(fci)?o(*i) e-*^" 8 ^) | dg| J §(A; 1 -g)| 2 ti?(t(e(A: 1 ) -e(g))), 
where 

u 

Simple calculation shows that tR(tu) — > 7i5(u) as t — > oo. Thus we have 

EW a ,o 1 t(e,«)->-|l&(«)| 2 e- fl,0 M«). 

where we defined 

a{v) := J a{v,q)dq. 

Similar result holds for Wo,2,t and combining it with (I8.14p . we obtain the conservation of the 
L 2 -norm up to second order in A, since with £ = we have 



dv 



EWi,i,t(0, v) + EW 2j0 ,t(0, v) + EW ,2,t(0, u) 



0. 



For general £ and after testing against a smooth function, we obtain 

W,W z ,o,t)^~ [ dvJ(Tv,v)a(v)\Mv)\ 2 



2 . 

which corresponds to the loss term in the Boltzmann equation. 

Apart from revealing how the Boltzmann equation emerges from the quantum expansion, 
the above calculation carries another important observation. Notice that terms with n ^ n' 
did give rise to non-negligible contributions despite the earlier statement ( I7.8J) that n = n' is 
forced by the non-repetition rule (which is correct) and that repetitive collision sequences are 
negligible (which is, apparently, not correct). The next section will explain this. 
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Figure 17: Schematic picture of the renormalization 



8.4 Self-energy renormalization 

The previous simple explicit calculations showed that not all repetition terms are neglible, in 
fact immediate recollisions are of order one and they - but only they - have to be treated 
differently. We say that a collision sequence (ai, a^, ■ • ■) has an immediate recollision if = 
for some i. This modification must also be implemented in the stopping rule, such that 
immediate recollisions do not qualify for recollisions for the purpose of collecting reasons to 
stop the expansion (see Section fBTTj) . 

Fortunately immediate repetitions appear very locally in the Feynman graph and they can 
be resummed according to the schematic picture on Fig. [T71 The net effect is that the free 
propagator e(p) = \y 2 needs to be changed to another propagator cu(p) that differs from e(p) 
by an 0(X 2 ) amount. More precisely, using the time independent formalism, second line of 
we notice that the first term in Fig. [T7] is represented by 

(8.16) 



a — e(p) — ir/ 



(which is also often called propagator). The second term carries a loop integration (momentum 
q), and its contribution is 

1 f X 2 \B(p - q)\ 2 ^ 



(a — e(p) — ir/) 2 J a — e(q) — ir] 

This formula is written for the continuum model and the integration domain is the whole 
momentum space M. d . For the lattice model B is absent and the integration is over T d . 

The third term carries two independent loop integration (momenta q and q'), and its 
contribution is 

X 2 \B(p-q)\ 2 dq f X 2 \B{p-q')\ 2 dq' 



(a — e(p) — ir]) 3 J a — e(q) — ir] J a — e(q') — ir] 

Setting 



® v (p,a) 



\B{p — q)\ 2 dq 



a — e(q) — ir] 

and noticing that due to the almost singularities of the (a — e(p) — irf)~ k prefactors the main 
contribution comes from a ~ e(p), we can set 

6(p) := lim e(p,e(p)). (8.17) 

r?->-0+ ' 
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(Similar, in fact easier formulas hold for the lattice model). Therefore, modulo negligible 
errors, the sum of the graphs in Fig. [T7] give rise to the following geometric series: 



1 



+ 



\ 2 9{p) 



+ 



(A 2 %)) : 



+ 



a — (e(p) + \ 2 9{p)) — ir] 



a — e(p) — ir] (a — e(p) — ir]) 2 (a — e(p) — ir]) 3 
This justifies to define the renormalized propagator 

w(p) := e(p) + A 2 %) 

and the above calculation indicates that all immediate recollisions can be taken into account 
by simply replacing e(p) with cu(p). This is in fact can be rigorously proved up to the leading 
order we are interested. 

An alternative way to see the renormalization is to reorganize how the original Hamiltonian 
is split into main and perturbation terms: 

H = e (p) + \ 2 6(p) +XV - X 2 9(p). 

S * ' 

The precise definition of the correction term 9(p) is determined by the following self-consistent 
equation: 

«&>)= 1 &q 



u(p) — u)(q) + iO ' 

the formula (18.171) is in fact only the solution to this equation up to order A 2 , but for our 
purposes such precision is sufficient. The imaginary part of 9 can also be computed as 



a(p) := rj 



dq 



\io(p) - Lo(q) + ir]\ 



-> lm9(p) r] ->■ 



and notice that it is not zero. In particular, the renormalization regularizes the propagator: 
while the trivial supremum bound on the original propagator is 



sup 

v 



< 



V 



a — e(p) — ir/ 

the similar bound on the renormalized propagator is much better: 



a — Lo{p) — ir/ A 2 + rj 



< 



i.li 



Strictly speaking, this bound does not hold if p ~ since lm9(p) vanishes at the origin, but 
such regime in the momentum space has a small volume, since it is given by an approximate 
point singularity, while the (almost) singularity manifold of ( 18.161) is large, it has codimension 



one. 



The bound ( 18. 18ft will play a crucial role in our estimates. Recall that due to the exponen- 
tial prefactor e 2r?< in (18.111) . eventually we will have to choose r] ~ 1/t. Thus in the diffusive 
scaling, when t 3> A -2 , the bound (I8.18P is a substantial improvement. 
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Figure 18: Ladder and antiladder 

The precise formulas are not particularly interesting; the main point is that after renor- 
malization: only the ladder has classical contribution and gives the limiting equation. If one 
uses the renormalized propagators, then one can assume that no immediate repetitions oc- 
cur in the expansion (in practice they do occur, but they are algebraically cancelled out by 
the renormalization of the propagator). Only after this renormalization will the value of the 
ladder graph given in Fig. [15] be correct with the exponential damping factor. From now on 
we will assume that the renormalization is performed, the propagator is u(p) and there is 
no immediate recollision. In particular, the statements of the key Theorems 18.11 and 18.21 are 
understood after this renormalization. 



8.5 Control of the crossing terms 

The key to the proof of Theorem 18.21 is a good classification of all Feynman diagrams based 
upon the complexity of the permutation tt : A — > B. This complexity is expressed by a 
degree d(rr) that is defined, temporarily, as follows. We point out that this is a little simplified 
definition, the final definition is a bit more involved and is given later in Definition 19.11 

Definition 8.4. Given a permutation tt G S n on {1, 2, . . . , n}, an index i is called ladder 
index if \ir(i) — tt{i — 1)| = 1 or — w(i + 1)| = 1. The degree of the permutation tt 

is defined as 

d(ir) = ^{non-ladder indices}. (8.19) 

Two examples are shown on Fig. [TSJ The top pictures show the pairing in the Feynman 
diagrams, the bottom pictures show the graph of the permutation as a function 

7i : {1,2, ...,n} ->■ {l,2,...,n}, 

where the set {1,2, ... ,n} is naturally embedded into the reals and we conveniently joined 
the discrete points of the graph of tt. The dotted region encircles the ladder indices: notice 
that a long antiladder also has many ladder indices. This indeed shows that it is not really the 
number of total crosses in the Feynman graph that is responsible for the smallness of the value 
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of the Feynman graph, rather the disordered structure and that is more precisely expressed 
by the non-ladder indices. 

The main philosophy is that most permutations have a high degree (combinatorial com- 
plexity). The key estimates are the following two lemmas. The first lemma estimates the 
number of permutations with a given degree; the proof is an elementary combinatorial count- 
ing and will not be presented here. 

Lemma 8.5. The number of permutations with a given degree is estimated as 

#{vr : d(7r) = 4 < (Cn) d . 

The second lemma is indeed the hard part of the proof; it claims that the value of the 
Feynman graph decreases polynomially (in A) as the degree increases. Thus we can gain a A 
factor per each non-ladder vertex of the Feynman graph. 

Lemma 8.6. There exists some positive k, depending on the dimension d, such that 

Val{n) < (CA) KdW . (8.20) 
Combining these two Lemmas, we can easily control the series Val(7r): 

oo 

^Val(7r) = ^ Val M = ^C d n d \ Kd < oo 

7TeSn d=0 TT:d{TT)=d d 

if n < K ~ A~ K (assume 5 = for simplicity). Since n ~ X 2 t, get convergence for t < cA~ 2 ~ K , 
i.e. the k from Lemma [8.61 determines the time scale for which our proof is valid. 

We remark that, although for technical reasons we can prove f)8.20p only for very small 
k, it should be valid up to k = 2 but not beyond. To see this, recall that the best possible 
estimate for a single Feynman graph is 0(X 2n ) and their total number is n\. Since 

A 2n n! « (AVT « (A 4 t) n , 

the summation over all Feynman graphs will diverge if t > A -4 . This means that this method is 
limited up to times t <C A -4 . Going beyond t ~ A -4 requires a second resummation procedure, 
namely the resummation of the so-called four-legged subdiagrams. In other words, one cannot 
afford to estimate each Feynman graph individually, a certain cancellation mechanism among 
them has to be found. Similar resummations have been done in many-body problems in 
euclidean (imaginary time) theories but not in real time. In the current problem it is not clear 
even on the intuitive level which diagrams cancel each other. 

8.6 An example 

In this very concrete example we indicate why the cross is smaller than the ladder by a factor 
A 2 , i.e. we justify the first estimate in Fig. [TBJ We recall that the value of a Feynman graph 
of order n is 

Val(vr) = A 2 V" /dpdp'dad/3 e u ^ f[ _/ . ^— - A^p, p') , (8.21) 
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Figure 19: Ladder and cross 



where the delta function comes from momentum conservation. Notice that the integrand is 
a function that are almost singular on the level sets of the dispersion relation, i.e. on the 
manifolds 



For the typical values of a and /3, these are manifolds of codimension d in the 2(n + l)d 
dimensional space of momenta. The singularities are regularized by the imaginary parts of 
the denominators, lmco(pj) + r], which are typically positive quantities of order 0(A 2 + 77). 
The main contribution to the integral comes from regimes of integration near these "almost 
singularity" manifolds. We remark that in the continuum model the momentum space extends 
to infinity, so in principle one has to control the integrals in the large momentum (ultraviolet) 
regime as well, but this is only a technicality. 

The main complication in evaluating and estimating the integral (18.211) comes from the 
delta functions in A 7r (p, p') since they may enhance singularity overlaps and may increase the 
value of the integral. Without these delta functions, each momentum integration dpj and dp'j 
could be performed independently and the total value of the Feynman graph would be very 
small, of order X 2n \ log \\ 2 ( n+1 \ It is exactly the overlap of these (almost) singularities, forced 
by the delta functions, that is responsible for the correct size of the integral. 

To illustrate this effect, we consider the simplest example for a cross and compare it with 
the direct pairing (ladder). The notations are found on Fig. [TH For simplicity, we assume 
that A 2 = 77, i.e. we are in the kinetic regime and the extra regularization coming from the 
renormalization, lm.8, is not important. 

In case of the ladder, the delta functions force all paired momenta to be exactly the same 
(assuming that po = p' Q since we compute the L 2 -norm of ip t ), i.e. 



a = Reu(pj) = e(pj) + X 2 Re9(pj), 



(3 = Reuip'j) = e(pj.) + A 2 Re0(p;.)- 




Val(ladder) = A 4 e 2r " / dad/3 e i{a ~ p)t [ 



J a — u(po) — irj a — uj(pi) — if] a — ufa) — ii] 



It 1 \ i • a / \ I • -n 1 \ , ■ dp dpidp 2 . (8.22) 
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Choosing rj = 1/t, with a simple analysis one can verify that the main contribution comes 
from the regime where \a — /3| < t~ x , thus effectively all singularities overlap (with a precision 
if). We have 

dp 1 A-. (8.23) 



| a — uj{p) + ir]\ 2 |Ima;| 

The intermediate relations are not completely correct as they stand because uj depends on the 
momentum and its imaginary part actually vanishes at p = 0. However, the volume of this 
region is small. Moreover, in the continuum case the ultraviolet regime needs attention as 
well, but in that case the decaying form factor |I?| 2 is also present. A more careful calculation 
shows that the final relation in ( I8.23|) is nevertheless correct. 
Thus, effectively, we have 

Val(Ladder) ~ A 4 (A" 2 ) 2 ~ 0(1) (8.24) 

modulo logarithmic corrections. Here (I8.23P has been used twice and the last pair of denomi- 
tors integrated out by dad/3 collecting a logarithmic term: 

/ 1 f\ • | = 0(1 lo g^D ( 8 -25) 

J \a - u)(po) - ir]\ 

(recall that rj — 1/t ~ A 2 ). This estimate again is not completely correct as it stands, because 
the integral in (18.251) is logarithmically divergent at infinity, but the ultraviolet regime is 
always trivial in this problem. Technically, for example, here one can save a little more a 
decay from the other denominators in (18.221) . Both inequalities (I8.23j) and (I8.25P hold both in 
the discrete and continuum case. In Appendix |A] we listed them and some more complicated 
related estimates more precisely that will also be used. 

As a rule of thumb, we should keep in mind that the main contribution in all our integrals 
come from the regimes where: 

(i) The two dual variables to the time are close with a precision 1/t, i.e. 

\a-P\< 1/t; 

(ii) The momenta are of order one and away from zero (i.e. there is no ultraviolet or infrared 
issue in this problem); 

(iii) The variables a, (3 are also of order one, i.e. there is no divergence at infinity for their 
integrations. 

By an alternative argument one can simply estimate all (but one) /3-denominators in f!8.22j) 
by an L°°-bound (I8.18p . i.e. by 0(A~ 2 ), then integrate out d/3 and then all a-denominators, 
using (18.251) and the similar L 1 -bound 

/ I r^-l = °(\^\) (8.26) 

J \a - u{p) - irj\ 

for the momentum integrals. The result is 

Val(Ladder) < A 4 (A~ 2 ) 2 (log A) 4 = (log A) 4 



64 



©(Pi) =a 



CO(p -pj+p 2 )=OC 




Figure 20: Overlap of the shifted level sets in the pi-space 

which coincides with the previous calculation f !8.24j) modulo logarithmic terms. In fact, one 
can verify that these are not only upper estimates for the ladder, but in fact the value of the 
n-th order ladder is 

Val(Ladder) ~ O n (l) (8.27) 
as A — > 0, modulo | log A| corrections. 

The similar calculation for the crossed diagram yields: 



Val(cross) = A / dad/3 e 



a — w(po) — irj a — u{jp\) — irj a — u{pi) — irj 

-dp dp 1 dp 2 . 



(3 - u(p ) +irj (3- cu(p -pi+P2) + iv P - w(p 2 ) + it] 

(8.28) 

Assuming again that the main contribution is from the regime where a ~ (with precision 
1/t), we notice that spherical singularities of the two middle denominators overlap only at a 
point singularity 

1 1 1 

dpi] r~, r~r ] 7 v r~r ~ ] ; 8.29 

| a - w(pi) - i7)\ \a — u)(p — pi +P2) +ir)\ \Po+P2\ + a 

(see also Fig. |20]and Lemma [A. 21 for a more precise estimate). In three dimensions the point 
singularity is harmless (will disappear by the next integration using (1A.51) from Lemma [A. 21) . 
and we thus obtain 

Val(cross) < A 2 Val(ladder) 

(modulo logarithms). 

We emphasize that inequality (18. 29ft in this form holds only for the continuous dispersion 
relation (see (IA.4j) in Appendix IA1 . i.e. if the level sets of the dispersion relation are convex, 



since it is relied on the fact that a convex set and its shifted copy overlap transversally, thus 
a small neighborhood of these sets (where \a — e(p)\ is small) have a small intersection. This 
is wrong for the level sets of the discrete dispersion relation which, for a certain range of a, is 
not a convex set (see Fig. I2TI) . However, ( 18.29P holds with an additional factor ?7~ 3 / 4 on the 
left hand side (see ( 1A.7I) ). which is a weaker estimate than in the continuous case but it is still 
useful because it is stronger than the trivial estimate 7]~ l obtained by by taking the L°°-norm 
of one propagator on the left hand side of (18.291) . 
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Figure 21: A level set of the discrete dispersion relation e(p) from (15. 3p 



9 Integration of general Feynman graphs 
9.1 Formulas and the definition of the degree 

We introduce a different encoding for the permutation tt by using permutation matrices. To 
follow the standard convention of labelling the rows and columns of a matrix by positive 
natural numbers, we will shift all indices in (18. lip by one, i.e. from now on we will work with 
the formula 

f n+1 

Val(Tr) = A 2 V* /dpdp'dad/3 e ft <«-# FT _ _ A^p, p') , (9.1) 

where 

n 

A 7r (p, p') := 5{p n+l - p' n+1 ) Yl $ (Oj+l - Pj) ~ (pJr(j)+l ~ PJr(j))) • 

i=i 

We introduce a convenient notation. For any {n + 1) X (n + 1) matrix M and for any vector 
of momenta p = (pi, . . .p n+1 ), we let Mp denote the following (n + l)-vector of momenta 

n+1 n+1 

m p : = ( E M «Pi> E M ^ • • • ) • (9 - 2) 
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XV/ XW 



These primed momenta are equal +p- + ... These primed momenta are equal -p. + ... 



Figure 22: Domain of dependencies of the momenta 

The permutation n e S n acting on the indices {1,2, ... ,n} in ( 19.11) will be encoded by an 
(n + 1) x (n + 1) matrix M(tt) defined as follows 

{1 if 7r(j - 1) < i < 7r(j) 

-1 if 5r(j) < i < - 1) (9.3) 

otherwise, 

where, by definition, tt is the extension of 7r to a permutation of {0, 1, . . . , n+1} by a(0) := 
and a(n + 1) := n + 1. In particular [Mp]i = p\, [Mp] n +i = p n +i- It is easy to check that 

n+1 

A.(p, P ')=n 5 (^-[ M p^) > ( 9 - 4 ) 

in other words, each p'-momentum can be expressed as a linear combination of p-momenta, 
the matrix M encodes the corresponding coefficients, and these are all the relations among 
the p and p 1 momenta that are enforced by A n . In particular, all p-momenta are independent. 

The rule to express p'-momenta in terms of p-momenta is transparent in the graphical 
representation of the Feynman graph: the momentum pj appears in those p^-momenta which 
fall into its "domain of dependence", i.e. the section between the image of the two endpoints 
of pj, and the sign depends on the ordering of these images (Fig. [22]). Notice that the roles of 
p and p' are symmetric, we could have expressed the p-momenta in terms of p'-momenta as 
well. It follows from this symmetry that M(7r _1 ) = [M(7r)] _1 and 

n+1 

A 7T (p,p') = l[5^p J -[M- 1 p'] J ) 

i=i 

also holds. 

A little linear algebra and combinatorics reveals that M is actually a totally unimodular 
matrix, which means that all its subdeterminants are or ±1. This will mean that the 
Jacobians of the necessary changes of variables are always controlled. 

The following definition is crucial. It establishes the necessary concepts to measure the 
complexity of a permutation. 
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Definition 9.1 (Valley, peak, slope and ladder). Given a permutation tt G S n let a be its 

extension. A point (j, vr(j)) ; j e 4 = {1,2,..., n}, on the graph of tt is called peak if < 
min{7f(j — l),vf(j + 1)}, it is called valley if > max{7r(j — l),7r(j + 1)}. Furthermore, 
if^U) —l£ — l),7r(j + 1)} an d (j, 7r(j)) is not a valley, then the point (j, j G I n , 

is called ladder. Finally, a point (j,ir(j)), j G I n , on the graph of it is called slope if it is 
not a peak, valley or ladder. 

Let I = {1, 2, . . . , n + 1} denote the set of row indices of M. This set is partitioned into 
five disjoint subsets, I = I p U I v U It U I s U Ii as t, such that Ii ast := {n + 1} is the last index, 
and i G I p ,I v ,Ii or I s depending on whether (7r _1 («), i) is a peak, valley, ladder or slope, 
respectively. The cardinalities of these sets are denoted by p := \I P \, v := \I V \, i := and 
s := |/ s |. The dependence on tt is indicated as p = p(tt) etc. if necessary. We define the 
degree of the permutation n as 

deg(vr) := d(n) := n - £(ir) . (9.5) 

Remarks: (i) The terminology of peak, valley, slope, ladder comes from the graph of the 
permutation tt drawn in a coordinate system where the axis of the dependent variable, 7r(j), 
is oriented downward (see Fig. |2"5|) . It immediately follows from the definition of the extension 
tt that the number of peaks and valleys is the same: 

p = v . 

By the partitioning of /, we also have 

p + v + £ + s + l = n + l. 

(ii) The nonzero entries in the matrix M(a) follow the same geometric pattern as the 
graph: each downward segment of the graph corresponds to a column with a few consecutive 
l's, upward segments correspond to columns with (— l)'s. These blocks of nonzero entries in 
each column will be called the tower of that column. On Fig. |23]we also pictured the towers 
of M{tt) as rectangles. 

(iii) Because our choice of orientation of the vertical axis follows the convention of labelling 
rows of a matrix, a peak is a local minimum of j — > 7r ( j ) . We fix the convention that the 
notions "higher" or "lower" for objects related to the vertical axis (e.g. row indices) always 
refer to the graphical picture. In particular the "bottom" or the "lowest element" of a tower 
is located in the row with the highest index. 

Also, a point on the graph of the function j — > Tr(j) is traditionally denoted by (j, 7r(j)), 
where the first coordinate j runs on the horizontal axis, while in the labelling of the {i,j)- 
matrix element My of a matrix M the first coordinate i labels rows, i.e. it runs vertically. To 
avoid confusion, we will always specify whether a double index (i, j) refers to a point on the 
graph of tt or a matrix element. 

(iv) We note that for the special case of the identity permutation tt = id = id n we have 
I p — I s — I v — 0, and Ii = {1, 2, . . . , n}. In particular, deg(zrf) = and deg(7r) > 2 for any 
other permutation tt ^ id. 

(v) Note that Definition 19.51 slightly differs from the preliminary definition of the degree 
given in (I8.19p . Not every index participating in a ladder are defined to be a ladder index; the 
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Figure 23: Graph of a permutation with the towers 



top index of the ladder is excluded (to avoid overcounting) and the bottom index of a ladder 
is also excluded if it is a valley, since the valleys will play a special role. The precise definition 
of the degree given in Definition 19.51 is not canonical, other variants are also possible; the 
essential point is that long ladders should reduce the degree. 



An example is shown on Fig. [23] with n = 8. The matrix corresponding to the permutation 
on this figure is the following (zero entries are left empty) 

A 



M(ir) 



-1 
-1 



V 



V 



1 

2 
3 
4 
5 
6 
7 
8 
9 



V 



v 
s 

(last) 



(9.6) 



The numbers on the right indicate the column indices and the letters show whether it is 
peak/slope/valley/ladder or last. In this case I p = {3}, I v = {7}, I s = {5, 8}. U = {1, 2, 4, 6}, 
hast = {9} and deg(cr) = 4. 
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9.2 Failed attempts to integrate out Feynman diagrams 



To estimate the value of a Feynman graph, based upon ( 19. 1 1) and (19.41) . our task is to succes- 
sively integrate out all p/s in the multiple integral of the form 



/„ n+l 
dad/3 / dp Yl 
J i=i 



1 



1 



\a - w(pi) -irj\ \f3- w(££i AfyPi) + ^| 

As usual, the unspecified domains of integrations for the a and f3 variables is R, and for the 
dp momentum variables is (T d ) n+1 . As one pj is integrated out, the matrix M needs to be 
updated and effective algorithm is needed to keep track of these changes. 

Why is it hard to compute or estimate such a quite explicit integral? The problem is that 
there is nowhere to start: each integration variable pj may appear in many denominators: 
apparently there is no "easy" integration variable at the beginning. 

As a first attempt, we can try to decouple the interdependent denominators by Schwarz or 
Holder inequalities. It turns out that they cannot be used effectively, since by their application 
we will lose the non-overlapping effects imposed by the crossing. If one tries to decouple these 
integrals trivially, one obtains the ladder and gains nothing. In contrast to a complicated 
crossing diagram, the ladder is easily computable because pi = p\ means that the integrals 
decouple: 



Q[M) < X 2n I dad/3 / dp 



1 



1 



a 



u(p n+1 )-irj\ \/3-u(p n+1 ) +irj\ 



(9.7) 



n 

i=i 



a 



"(Pi) -H 2 YT j= i MijPj) + iv I 



where M denotes the n x n upper minor of M and we used that M and M differ only with 
an entry 1 in the diagonal, so in particular Y^j=% ^n+ijPj — Pn+i and det(M) = det(M). 
Changing variables in the second term (p^ = Y^=i ^ijPj) an d using that the Jacobian is one 
(det(M) = ±1), we see that the first and second terms in the parenthesis are exactly the 
same. Thus 



Q(M) < 2X Zn / dad/3 / dp 



1 



1 



| a - u(p n +i) -iv\ \P- u(p n +i) + irj\ f = \ \ a ~ u (Pi) - ir ) 



n 



i 



and we can succesively integrate out the momenta Pi,P2, ■ ■ -Pn and then finally a, (3. Using 
the inequalities (I8.23j) and (I8.25p . we obtain (with the choice r) — 1/t) 

\Val(7r)\ < e 2rit Q{M) < CA 2n A" 2n (| logry) 2 ) = C{\ \ogr]\ 2 ) 

i.e. essentially the same estimate as the value of the ladder (j8.27p . 

As a second attempt, we can trivially estimate all (but one) /3-denominators, by their 
L°° norm 







+ if] 



< 



A" 



(9.8) 
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(see ( I8.18P and the remark afterward on its limitations), and then integrate out all pj one by 
one by using the Z^-bound (18.261) 

/ 1 TY-~l = 0(\\og V \). 

J \a - uj(p) + ir)\ 

This gives 

\Val(n)\< 0(| log r/r 2 ), 

i.e. it is essentially order 1 with logarithmic corrections. Note that this second method gives 
a worse exponent for the logarithm than the first one, nevertheless this method will be the 
good starting point. 

The second attempt is a standard integration procedure in diagrammatic perturbation 
theory. The basic situation is that given a large graph with momenta assigned to the edges, 
each edge carries a propagator, i.e. a function depending in this momenta, and momenta are 
subject to momentum conservation (Kirchoff law) at each vertex. The task is to integrate 
out all momenta. The idea is to use L°°-bounds for the propagator on certain edges in the 
graph to decouple the rest and use independent L 1 -bounds for the remaining integrations. 
Typically one constructs a spanning tree in the graph, then each additional edge creates a 
loop. The Kirchoff delta functions amount to expressing all tree momenta in terms of loop 
momenta in a trivial way. Then one uses L°°-bound on "tree" -propagators to free up all 
Kirchoff delta functions, integrate out the "tree" -variables freely and then use L 1 -bound on 
the "loop" -propagators that are now independent. 

This procedure can be used to obtain a rough bound on values of very general Feynman 
diagrams subject to Kirchoff laws. We will formalize this procedure in Appendix [Bl Since typ- 
ically L l and L°° bounds on the propagators scale with a different power of the key parameter 
of the problem (in our case A), we will call this method the power counting estimate. 

In our case simple graph theoretical counting shows that 

Number of tree momenta = number of loop momenta = n + 1. 

In fact, after identifying the paired vertices in the Feynman diagram (after closing the two 
long horizontal lines at the two ends), we obtain a new graph where one can easily see that the 
edges in the lower horizontal line, i.e. the edges corresponding ^-momenta, form a spanning 
tree and the edges of all pj-variables form loops. Thus the delta functions in (19 .4p represent 
the way how the tree momenta are expressed in terms of the loop momenta. 

Since each L°°-bound on "tree" -propagators costs A -2 by (19.81) . the total estimate would 
be of order 

^2n^-2(n+l) 

with actually logarithmic factors. But due to the additional a, (3 integrations, one L°°-bound 
can be saved (modulo logarithm) by using (18.251) . So the total estimate is, in fact, 

A 2 "A- 2 " = 0(1) 

modulo logarithmic factors, i.e. the same size as for the ladder diagrams. The conclusion is 
that even in the second attempt, with the systematic power counting estimate, we did not 
gain from the crossing structure of the permutation either. 
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9.3 The new algorithm to integrate out Feynman diagrams 

In our new algorithm, we use the L°°-bound on the propagators for a carefully selected sub- 
set of the "tree" -variables: the ones that lie "above the peak". The selection is shown in 
Fig. [21] for a concrete example. Notice that the segments in the horizontal axis correspond to 
Pi,P2, ■ ■ ■ ,Pn+i, i-e. the loop momenta, and the segments in the vertical axis correspond to 
the tree momenta. 

We will first explain the main idea, then we will work out a concrete example to illustrate 
the algorithm. After drawing the graph of the permutation, a structure of valleys and peaks 
emerges. By carefully examining the relation between this graph and the matrix M, one 
notices that if one estimates only those "tree" -propagators that lie above a peak by the trivial 
L°°-bound, then all the remaining propagators can be successively integrated out by selecting 
the integration variables pj in an appropriate order (dictated by the graph). Each integration 
involves no more than two propagators at one time, hence it can be estimated by elementary 
calculus (using estimates collected in Appendix lAl. 

In fact, this is correct if there are no ladder indices; but momenta corresponding to ladder 
indices can be integrated out locally (see later). As it turns out, ladder indices are neutral; 
their integration yields an 0(1) factor. The gain comes from non-ladder indices, and this will 
justify the definition of the degree (19.51) . 

More precisely, after an easy bookkeeping we notice that in this way we gain roughly as 
many A 2 factors as many slopes and valleys we have (no gain from peaks or ladders). Since 
the number of peaks and valleys are the same, 

p = v, 

and the peaks, valleys, slopes and ladders altogether sum up to n, 

p+v+£+s=n 

(see Remark (i) after Definition I9.ip . we see that 

1 1 

v + s = n — p — £ > -{n — £) = -d(n), 

since n — 2p — £ = n — p — v — £ = s > 0. Thus we gain at least . This would prove (18.201) 
with k = 1. After various technical estimates that were neglected here, the actual value of k 
is reduced, but it still remains positive and this completes the proof of Lemma 18.61 □ 

9.3.1 An example without ladder 

As an example, we will show the integration procedure for the graph on Fig. |2H This is 
a permutation which has no ladder index for simplicity, we will comment on ladder indices 
afterwards. The total integral is (for simplicity, we neglected the ±ir] regularization in the 
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Loop momenta 




Figure 24: Valleys and peaks determine the order of integration 
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formulas below to highlight the structure better) 

1111111 



A / dad/3dp 



a-u(pi)\ \a-u(p 2 )\ \a-u(p 3 )\ \a-u{p±)\ \a-u(p 5 )\ \a - u(p 6 ) \ \a-u(p 7 )\ 
111 1 



|/3 - u(pi)\ |/3 - u(pi -Pa+P5) \ \P ~ u(P2 -Pa+P5) \ 1/3 - u(P2 -Pa + P5 - Pe +Pr)\ 

x - - - . (9.9) 

|/3 - u(p 2 -pa +P5 -Pb +Pr)\ \P - u(p 2 - P3 + Pr) \ \P - u(p 7 )\ 

Notice that every variable appears at least in three denominators, for a graph of order n, 
typically every momentum appears in (const)n different denominators, so there is no way 
to start the integration procedure (an integral with many almost singular denominators is 
impossible to estimate directly with a sufficient precision). 

Now we estimate those /3-factors by L°°-norm (i.e. by A~ 2 according to (19.81) ) whose 
corresponding primed momenta lied right above a peak; in our case these are the factors 

1 1 



l£-w(pi)| \/3-u(p' 3 )\ 

i.e. the first and the third /3- factor in (I9.9p . After this estimate, they will be removed from 
the integrand. We will lose a factor (A~ 2 ) p = A -4 recalling that p = 2 is the number of peaks. 
We are left with the integral 

A 12 (A- 2 ) P / dad/3dp ' ' ' ' ' 1 



x 



a - u)(pi) \ \a - w(p 2 )\ \a - w(p 3 )| \a - w(p 4 )| \a - u(p 5 ) \ \a - uj(p e )\ 
1 1 1 

|a-w(p 7 )| |^-w(pi-p 4 +p B )| \P -u(p 2 -p 4 + P5-P6+P7)\ 



x 1 1 1 (9 10) 

|/3 - u(p 2 -p 3 +p 5 -p 6 +pr)\ |/3 - u(p 2 - p 3 + p 7 )\ \(3 - u(p 7 )\' 

Now the remaining factors can be integrated out by using the following generalized version of 
flEZSD (see Appendix |A]) 

f d P 1 ^ Ijgggl / n11 x 

a,/3 J \oi — U){p) + iri\\p -u(p + u) -vr]\ \u\ + A z 

Suppose we can forget about the possible point singularity, i.e. neglect the case when 
\u\ 1. Then the integral ( 19. lip is 0(1) modulo an irrelevant log factor. Then the successive 
integration is done according to graph: we will get rid of the factors \f3—u(p' 1 )\~ 1 , \f3—u(p' 2 )\^ 1 , 
|/3 — uijp^)^ 1 , etc., in this order. The factors with p[ and p 3 have already been removed, so 
the first nontrivial integration will eliminate 

' ' (9-12) 



Which integration variable to use? Notice that the point (1,2) was not a peak, that means 
that there is a momentum (in this case pi) such that p 2 is the primed momenta with the 
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largest index that still depends on p± (the "tower" of p± ends at p' 2 ). This means that among 
all the remaining /3-factors no other /3-factor, except for (I9.12p . involves p\\ Thus only two 
factors involve p\ and not more, so we can integrate out the p\ variable by using (19. lip 

1 1 

1 1 a - w(pi)| |/3 - w(pi -pa + Ps)\ 

(modulo logs and modulo the point singularity problem). 
In the next step we are then left with 



A 12 (A~ 2 ) P Jdad(3dp 



11111 

a - Lo(p 2 )\ \a - uj(p 3 )\ \at - w(pa)\ I" - u)(p 5 )\ \a - u){p & )\ 
1 1 



x 



X 



W-u{p 7 )\ |/3 - w(p 2 -p 4 +p 5 -p 6 +p 7 )\ 

1 1 1 

\/3 - U(p 2 ~Pz +P5 -P6 +Pt) \ \/3 - U(P2 ~P3 +Pr)\ \P-U(p 7 



Since we have already taken care of the /3-denominators with p'i,p' 2 , the next one would be 
the /3-denominator with p' 3 , but this was estimated trivially (and removed) at the beginning. 
So the next one to consider is 

1 1 



\P-u(p' 4 )\ |/3 -w(p2-Pa + P5-P6+Pt)\' 

Since p' 4 is not above a peak, there is a p-momentum whose tower has the lowest point at the 
level 4, namely pa- From the graph we thus conclude that p 4 appears only in this /3-factor 
(and in one a-factor), so again it can be integrated out: 



/ 



\a-uj(pi)\ |/3 - u{p 2 -Pa +P5 - Pe + P7)\ 



(modulo log's and point singularity). 
We have then 

A 12 (A~ 2 ) P f dad/3dp- 

J a - a 



1 1 1 



x 



a - Lo(p 2 )\ | a - w(p 3 )| | a - w(p 5 )| \a - to(p 6 )\ 
11 11 

\a-u(p 7 )\ |/3 -u(p 2 - pa + P5 - Pe +Pt)\ 1/3 - w(p 2 - P3 + Pt)\ \(3 - u(p 



7 



Next, 

1 1 



l^- w (P^)l I/ 3 ~ W G°2 ~P3 +P5 -P6 + P7>| 

includes even two variables (namely p$, pg) that do not appear in any other /3-denominators 
(because the towers of ps and p§ end at the level 5, in other words because right below the 
row of p' 5 there is a valley). We can freely choose which one to integrate out, say we choose 
P5, and perform 

dp51 7 — 77 77; 7 77 < Oil). 

I" - W(p 5 )| \P - U{p 2 -P3+P5-P6 + P7)\ 
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We are left with 

1 1 



\ 12 (\- 2 ) p /dad/3d Pl 

J a -a. 



-u(p 2 )\ \a-u(p 3 )\ \a-u(p 6 )\ 

x 1 1 1 (9 13) 

\a-oj(p 7 ) \ |/3 -oj{p 2 - p 3 +Pr)\ \P - w(pr)\' 



Finally 

1 1 



\P-u(jQ\ \P-uj(p 2 -p 3 + P7 )\ 
can be integrated out either by p 2 or p 3 (both towers end at the level of p' 6 ), e.g. 

dp 2 -, 7 — \TTn 7 \T — 0(1). 

\a - oj(p 2 ) \ \p - uj{p 2 -pa +P7)\ 

The last /3-factor is eliminated by the (3 integration by ( 18.251) and then in the remaining 
integral, 

A 12 (A~ 2 r / dad P] * , * T1 

7 |a - w(p 3 )| |a - w(pb)| \a - u(p 7 )\ 

one can integrate out each remaining momenta one by one. We have thus shown that the 
value of this permutation is 

Val(Tr) < A 12 (A~ 2 ) P = A 8 

modulo log's and point singularities. 
In general, the above procedure gives 

if there are no ladders, I = 0. 



9.3.2 General algorithm including ladder indices and other fine points 

Finally, we show how to deal with ladder indices. The idea is that first one has to integrate 
out the consecutive ladder indices after taking a trivial Schwarz inequality to decouple the 
a and /3-factors of the consecutive ladder indices and then proceed similarly to ( 19. 7]) . It is 
important that only propagators with ladder indices will be Schwarzed, for the rest of the 
integrand we will use the successive integration procedure explained in the previous section. 
In this way the ladder indices remain neutral for the final bookkeeping, in fact, the integral 

f ^ A- 2 

J \a — oo(p) — ir]\ 2 

exactly compensates the A 2 prefactor carried by the ladder index. Actually, one needs to take 
care that not only the A-powers but even the constants cancel each other as well, since an 
error C n would not be affordable when n, being the typical number of the collisions, is A~ K . 
Therefore the above bound will be improved to 

/ 1 n ■ i 2 dp = 1 + °( Al ~ 12K )> ( 9 - 14 ) 

J \a — uj(p) — t7]\ z 
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(see (1A.9P ). and it is this point where the careful choice of the remormalized propagator u(p) 
plays a crucial role. This is also one error estimate which further restricts the value of k. After 
integrating out the ladder indices, we perform the successive estimates done in Section 19.3.11 

There were two main issues have been swept under the rug (among several other less 
important ones...). First, there are several additional log factors floating around. In fact, 
they can be treated generously, since with this procedure we gain a 

X (const.)d(n) factor and the 

number of log factors is also comparable with d(n). However, here the key point again is that 
by the ladder integration in (I9.14p we do not lose any log factor; even not a constant factor! 

The second complication, the issue of the point singularities, is much more serious and this 
accounts for the serious reduction of the value of k in the final theorem. In higher dimensions, 
d > 3, one point singularity of the form (\p\ + 77) _1 is integrable, but it may happen that the 
same point singularity arises from different integrations of the form (19. lip along our algorithm. 
This would yield a high multiplicity point singularity whose integral is large. 

First, we indicate with an example that overlapping point singularities indeed do occur; 
they certainly would occur in the ladders, had we not integrated out the ladders separately. 

The structure of the integral for a set of consecutive ladder indices, {k, k+1, k+2, . . . k+m} 
is as follows: 

f 111 

= / dp fc dp fc+ i . . . dp fcH 



I a - uj(p k ) \ \a - u(p k+1 )\ " ' I a - uj(p k+m )\ 

1 1 1 . 

x . . . . (9.15) 

|/3 - u(p k + u)\ |/3 - u(p k+1 + u)\ " ' |/3 - u(p k+m + u)\' 

Here we used that if the consecutive ladders are the points (k, s), (k + 1, s + 1), (k + 2, s + 2), . . ., 
then the corresponding momenta are related as 

Pk ~ P' s = Pk+l ~ P' s+ 1 = Pk+2 - P' s+ 2 = ■■■ 

i.e. one can write p' s+i = p k+ i + u with a fixed vector u (that depends on all other momenta 
but not on p k ,p k +i, ■ ■ -p k+m , e.g. u = p 2 -Pi+P9)- 
Using (19. lip successively, we obtain 



n < 



Id + A 2 



m+l 



i.e. the same point singularity arises from each integration. We call this phenomenon accu- 
mulation of point singularities. Since u is a linear combination of other momenta, that need 
to be integrated out, at some point we would face with 

I \ m+l / \ m— 2 

dw ~ ( A~ 2 ) . (9.16) 



\u\ + X 



Since m+l consecutive ladder indices carry a factor A 2( - m+1 ' ) , we see that from a consecutive 
ladder sequence we might gain only A 6 , irrespective of the length of the sequence. It turns out 
that even this calculation is a bit too optimistic; the formula (I9.16P is only rough caricature, 
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also propagators depending on u are present in the u integration. In reality, eventually, the 
ladder indices are integrated out to 0(1). 

Based upon this example, one may think that the accumulation of point singularities occurs 
only in the presence of the consecutive ladders; once they are taken care of separately (and 
not gaining but also not losing from them), the remaining graph can be integated out without 
any accumulation of point singularities. We conjecture that this is indeed the case, but we 
could not quite prove this. Instead, we estimate several more /3-factors (propagators with 
"tree momenta" ) by the trivial L°°-bound to surely exclude this scenario. They are carefully 
chosen (see the concept of "uncovered slope indices" in Definition 10.3 of [23]) so that after 
their trivial removal, along the successive integration algorithm for the remaining propagators, 
indeed no accumulation of point singularity occurs. 

This completes the sketch of the proof of estimating the non-repetition Feynman diagrams, 
i.e. the proof of Theorem 18.21 □ 



10 Feynman graphs with repetitions 

In this short section we indicate how to prove Theorem 18.11 Recall that ip e s rr contains terms 
with many (n ^> A~ K ) potentials and terms with (not immediate) recollisions. 

The estimate of the terms with a large number of collisions is relatively easy; these are 
still non- repetition graphs, so the integration procedure from Section [9] applies. Permutations 
with degree d(ir) > C/k have a much smaller contribution than the required o(t~ 2 ) error. 
Permutations with low complexity contain macroscopically long ladders (i.e. ladders with 
length cn with some positive constant c) and these can be computed quite precisely. The 
precise calculation reveals a factor l/(cn)! due to the time-ordered integration (see discussion 
around (17. 12ft ) and the value of such graphs can be estimated by 



(cn)\ 

The combinatorics of such low complexity graphs is at most n c ^ K , so their total contribution 
is negligible if n ^> \ 2 t ~ A _K . 

The repetition terms from ?/>f rr require a much more laborous treatment. The different 
repetition patterns are shown on Fig. [25j When one of the repetitions shows up (apart from 
immediate repetition that were renormalized) , we stop the expansion to reduce complications 
(see (18. ip ). Actually the stopping rule is a bit more involved, because the repetition pattern 
has to collect sufficient "repetitions" to render that term sufficiently small even after paying 
the price for the unitary estimate, but we will no go into these details. Finally, each term 
we compute by "bare hand", after applying a certain graph surgery to reduce the number of 
different cases. One example of this reduction is shown on Fig. |26]while Fig. 1271 shows the result 
of an explicit estimate. The explicit estimates rely on the fact that the repetition imposes a 
certain momentum restriction that reduces the volume of the maximal overlap of singularities, 
reminiscent to the mechanism behind the crossing estimate, Section 18.61 Unfortunately, even 
after the graph surgery reduction, still a considerably number of similar but not identical cases 
have to be estimated on a case by case basis. 
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Figure 25: Various repetition patterns 
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Figure 26: Removal of a gate 
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Figure 27: Two sided recollision 
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Figure 28: Breaking up lumps 



10.1 Higher order pairings: the lumps 

We also mention how to deal with the higher order pairings that we postponed from Section 1872"! 
Recall that the stopping rule monitored whether we have a (non-immediate) repetition in the 
collision sequence A = (ax, a 2 , ■ ■ ■ , «„). This procedure has a side effect when computing the 
expectation: 

E J] % (p j+1 - Pj )V aj (q j+l -q j )= IJe^^-w-^-ft)] 

3 

The non- repetition restriction ae ^ au destroys the precise delta function J2 a eiap = $(p)- 
This requires to employ the following Connected Graph Formula that expresses the restricted 
sum as a linear combination of momentum delta functions: 

Lemma 10.1 (Connected Graph Formula). Let A n be the set of partitions of {1, ... ,n}. 
There exist explicit coefficients c(k) such that 

e* a '= £ II C (I^KE*) A=(A u A 2 ,...), 

where the summation is over all partitions A of the set {1, . . . , n}, and Ai, A 2 , . . . , denote the 
elements of a fixed partition, in particular they are disjoint sets and AiUA 2 U... = {1, . . . , n}. 

The appearence of nontrivial partitioning sets means that instead of pairs we really have 
to consider "hyperpairs" , i.e. subsets (called lumps) of size more than 2. 

Lumps have large combinatorics, but the value of the corresponding Feynman graph is 
small since in a large lump many more obstacle indices must coincide than in a pairing. Nev- 
ertheless, their treatment would require setting up a separate notation and run the algorithm 
of Section [9] for general lumps instead of pairs. 

To avoid these complications and repeating a similar proof, we invented a graph surgery 
that reduces lumps to pairs. The idea is that lumps are artificially broken up into pairs (i.e. 
permutations) and the corresponding Feynman graphs can be compared. The break-up is not 
unique, e.g. Fig. |2"51 shows two possible break-ups of a lump of four elements into two pairs. 

There are many break-ups possible, but the key idea is that we choose the break-up that 
gives the biggest d(7r). The following combinatorial statement shows a lower estimate on the 
maximal degree, demonstrating that every lump can be broken up into an appropriate pairing 
with relatively high degree. 
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Proposition 10.2. Let B = {-Bi, B2, . . .} be a partition of vertices. 

*(B):=i£){l S il : |Sil>4} 

TTien t/iere exists a permutation ir, compatible with B such that d(it) > |s(B). 

Using this lemma we reduce the estimate of hyperpairs to the previous case involving only 
pairs. This completes the very sketchy ideas of the proof of Theorem 18.11 □ 

11 Computation of the main term 

In this section we sketch the computation of the main term, i.e. the proof of Theorem 18.31 It 
is an explicit but nontrivial computation. The calculation below is valid only in the discrete 
case as it uses the fact that for any incoming velocity u, the collision kernel a(u, v) is constant 
in the outgoing velocity, i.e. the new velocity is independent of the old one (this was the 
reason why the diffusion coefficient could be directly computed (see (16. 9p ) without computing 
the velocity autocorrelation function via the Green-Kubo formula (16. lip as in the continuous 
case. 

The computation of the main term for the continuous case relates the diffusion coefficient 
to the underlying Boltzmann process. This procedure is conceptually more general but tech- 
nically somewhat more involved (see Section 6 of [21]), so here we present only the simpler 
method for the discrete model. 

The Fourier transform of the rescaled Wigner transform W(X/e, V) is given by 

where recall that e = \ 2+K / 2 is the space rescaling and t = \~ 2 ~ K T. 

We want to test the Wigner transform against a macroscopic observable, i.e. compute 

(0,EW t ) = (0(£,v),EWt(e£,v)) = J dvd£ 0(£,v) EW t {e£,v). 

Recall from Lemma [7. II that the Wigner transform enjoys the following continuity property: 

(0,E%) - {0,E%) < Cy/nW + E||0||VE||^-0|| 2 , 
in particular, by using (18. 3p . it is sufficient to compute the Wigner transform of 

K 
n=0 

We can express this Wigner transform in terms of Feynman diagrams, exactly as for the L 2 - 
calculation. The estimate (18. 5B in our key Theorem 18.21 implies, that only the ladder diagrams 
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matter (after renormalization) . Thus we have 

(O, EW t ) w ^ I dad/3 e it{a - p)+2tri 

k<K ^ R 



(11.1) 



k 

II [ / dv i R v ( a ' v j + y) ^ Wj ~ y 



2 



x 

i=2 



x y dD 1 E,(a^ 1 + ^,^ 1 -^ (£(,!) 1 ) 

with the renormalized propagator 



RJa,v) 



v ' ' a — e(v) — A 2 0(i>) + irj 

We perform each dvj integral. The key technical lemma is the following: 

Lemma 11.1. Let f(p) e C^R*) , a := (a + (3)/2, A 2+4k < 77 < X 2+K and fix r G R d with 
\r\ < \ 2+K / A . Then we have 

>^fiv) dv 

a — e(v — r) — X 2 9(v — r) — ir]j \J3 — e(v + r) — \ 2 6{v + r) + 



= -2iri 

where 



{a-f3) + 2(Ve)(v)-r-2t\ 2 l(a) { } 



1(a) := Im I — = / 5(e(v) — a) dv 

'a- e(v) - iO ' 



in particular, X(e(p)) = Im9(p). 

The proof of Lemma 111.11 relies on the following (approximate) identity 

1 1 



(a — g(v — r) — i0)((3 — g(v + r) + zO) a — (3 + g(v + r) — g(y — r) 

1 1 



.(5 — g(v) + iO a — g(v) — iO- 
and on careful Taylor expansions. □ 

Accepting this lemma, we change variables a = (a + (3)/2, b = (a — (3)/\ 2 and choose 
i] <C t^ 1 in flll.ip . Then we get 

(o,Ew t ) *y [ dedfldAe^fn / - 2 ^(^) W-;) dv . 
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with := W , F^ 1 ^ := O, and = 1, j ^ l,k + 1. Here W is the rescaled Wigner 
transform of the initial state. 

Let dfi a (v) be the normalized surface measure of the level surface S a := {e(v ) = a} defined 
via the co-area formula, i.e. the integral of any function h w.r.t. d/j, a (v) is given by 



(h) a := / h(v)d/i a (v) :-- 
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1(a) 



h{q) 



dm(q) 
|Ve(g)| 



where dm(q) is the Lebesgue measure on the surface S a . Often this measure is denoted by 
5(e(v ) — a)dv, i.e. 



h(v)dn a (v) 



7T 



1(a) 



h(v)5(e(v) — a)dv. 



Let H(v) := ^(J\ • Then we have 
(£>, EW t ) « 21(a) [d£ [ dadb e l2tx2x{a)b I 



fc+i 



n 



We expand the denominator up to second order 

—% 



iF^(C,' 



\J b + X~ 2 eH(vj) ■ £ — i 



dn a (v) 



b + e\- 2 H(v)-£-t 



11.2) 



dn a (v). 



After summation over k, and recalling that £ = 0(1) due the decay in the observable O, the 
effect of the last (error) term is K(e\~ 2 ) 3 = A -K (A K / 2 ) 3 = o(l), thus we can keep only the first 
three terms on the right hand side of (lll.2p . 

The linear term cancels out by symmetry: H(v) = —H(—v). To compute the quadratic 
term, we define 

D(a) := 41(a) J d^ a (v) H(v) <g> H(v), 
which is exactly the diffusion matrix (16.71) . Thus 



(0,EW t ) « Yl / d ^ / dfl 2X(a) / L?oK,^i)d/i a ( 
k<K J Jr J 



Vl ) / 0(£,v)dfi a (v) 



X 



d& e 2iA2l ( a )* fe 



6-2 



k+l 



e 2 \-\i,D(a)i) 1 
4X(a) (6 - i) 



n fc-i 



Setting 



B 2 := f^(^D(a)0 



41(a) 



the arising geometric series can be summed up: 

k+l 



2s \~b^i 



k=0 



B' 1 



(b-iy 



-i 



(b-if + B 2 



(b - if + i(b - i) 2 + iB 2 ' 
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We now perform the db integration analytically by using the formula (with A := 2A 2 X(a)): 



db e* Ab „ + f r= = 2rre~ tAB2 + o(l) 



{b - i) 3 + i(b - i) 2 + 1B 2 
from the dominant residue b = iB 2 . We can then compute 

tAB 2 = e 2 \-^ 1 -(Z,D(a)0 = ^(t,D{a)t). 

In particular, this formula shows that to get a nontrivial limit, e has to be chosen as e 
0(A _2_K//2 ), i.e. the diffusive scaling emerges from the calculation. Finally, we have 

(0,EW)n f d£ y dal(a)(J v)d/i (t;)) (W ) a exp ( - |<£, D(a)0 

where 

T 



f(T,£,a) := (Wo} a exp ( - -(^D(a)0 



is the solution of the heat equation ( 16. 6 p in Fourier space. This completes the sketch of the 
calculation of the main term and hence the proof of Theorem 18.31 □ 



12 Conclusions 

As a conclusion, we informally summarize the main achievements. 

(1) We rigorously proved diffusion from a Hamiltonian quantum dynamics in an environment 
of fixed time independent random scatterers in the weak coupling regime. The quantum 
dynamics is described up to a time scale t ~ A _2_K with some k > 0, i.e. well beyond 
the kinetic time scale. The typical number of collisions converges to infinity. 

(2) We identified the quantum diffusion constant (or matrix) and we showed that it co- 
incides with the diffusion constant (matrix) obtained from the long time evolution of 
the Boltzmann equation. This shows that the two-step limit (kinetic limit of quantum 
dynamics followed by the scaling limit of the random walk) yields the same result, up 
to the leading order, as the one-step limit (diffusive limit of quantum dynamics). 

(3) We controlled the interferences of random waves in a multiple scattering process with 
infinitely many collisions with random obstacles in the extended states regime. 

(4) In agreement with (2), we showed that quantum interferences and memory effects do 
not become relevant up to our scale. We remark that this is expected to hold for any k 
in d = 3, but not expected to hold for d = 2 [localization]. 

(5) As our main technical achievement, we classified and estimated Feynman graphs up to 
all orders. We gained an extra A-power per each non-ladder vertex compared with the 
usual power counting estimate relying on the L°° and L 1 -bounds for the propagators 
and on the tree and loop momenta. 
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A Inequalities 



In this appendix we collect the precise bounds on integrals of propagators that are used in the 
actual proofs. Their proofs are more or less elementary calculations, however the calculations 
are much easier for the continuous dispersion relation. For details, see the Appendix B of [23] 
for the continuous model and in Appendix A of [25] for the discrete case. 
We define the weighted Sobolev norm 

\\f\\m, n .= £ Mxrrmu 

\a\<n 

with (x) := (2 + x 2 ) 1 / 2 (here a is a multiindex). 

Lemma A.l (Continuous dispersion relation [24J). Let e(p) = \p 2 ■ Suppose that A 2 > rj > 
A 2+4k with k < 1/12. Then we have, 

\h(p-q)\dp < C\\h\\ 2d;0 | log A | log (a) 



and for < a < 1 



\a - w(p) + ir}\ ~ (a)V2(| g | - V /2H) 



l%-g)|dp < c a \\hh dfi v- 2{1 - a) (A2) 



|a - e(p) + ^| 2 "« " ( a )«/2(| g | - ^ja]) ' 

For a = and with h := 5 ; the following more precise estimate holds. There exists a constant 
Co, depending only on finitely many \\B\\}.^ norms, such that 

! fl% ( 7>l 2 ^ < 1 + G„A— [A + |a - «( 8 )H . (A.3) 

One can notice that an additional decaying factor h(p — q) must be present in the esti- 
mates due to the possible (but irrelevant) ultraviolet divergence. In the applications h = B. 
Moreover, an additional decaying factor in a was also saved, this will help to eliminate the 
logarithmic divergence of the integral of the type 

da ni 
I log ?7| 



| a — c + it] 



modulo the logarithmic divergence at infinity. 

The following statement estimates singularity overlaps: 

Lemma A. 2. Let dfi(p) = l(\p\ < C)dp, in applications £ ~ A~ K . For any \q\ < CA 



-i 



7i . = f dfijp) < C( d 3 \\ogr]\ 2 ^ 

J \a - e(p) + irj\ \f3 - e(p + q) + ir)\ ~ \\q\\ 

h . = f Mp) 1 < CVVV- 3 |logTy| 2 

J \a - e(p) + irj\ |/3 - e(p + q) + irj\ \\p — r|| _ 

uniformly in r,a,(3. Here := \q\ +7]. 
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To formulate the statement in the discrete case, we redefine 



III 9 III := V + mn i{|g — 7| : 7 is a critical point of e(p)} 

for any momentum q on the <i-dimensional torus. It is easy to see that there are 2 d critical 
points of the discrete dispersion relation e(p) = X^ =1 (l — cosp^). 

Lemma A. 3 (Discrete dispersion relation [2 5 J ) . The following bounds hold for the dispersion 
relation e(p) = X^=i(l — cosp^) in d > 3 dimensions. 

I ^ ~ ■ | I,, 1 I,, < c\ logr/l 3 , (A.6) 

\a — e{p) + »77 1 |||p — 7™ HI 

dp I < cr]- 3 ^\ log ?7 1 3 ^ ^ 



|a — e(p) + Z7y| |/3 — e(p + g) + 277 1 |||g|| 

^) := / 1 % ■ * ■ ,„, 1 „, < ^';°^' 3 (A.8) 

y | a — e[p) + ir)\ \p — e[p + q) + vr)\ \jp — r\\\ |||g||| 

uniformly in r, a, (3. With the (carefully) renormalized dispersion relation it also holds that 

sup / , ^ f . , 2 < 1 + C X^ (A.9) 

a J |a- uj(p) - vr}\ 1 

(compare with (1A.3jl ). 

Because the gain in the integrals flA.7j) . f ]A.8j) compared with the trivial estimate log 77 1 , 
is quite weak (compare with the much stronger bounds in Lemma IA.2l for the continuous case), 
we need one more inequality that contains four denominators. This special combination occurs 
in the estimates of the recollision terms. The proof of this inequality is considerably more 
involved than the above ones, see |19j. The complication is due to the lack of convexity of 
the level sets of the dispersion relation (see Fig. |2~T|) . We restrict our attention to the d = 3 
dimensional it turns out, this complication is less serious in higher dimensions. 

For any real number a we define 

I a I := min{|a|, \a — 2|, |a — 3|, \a — 4|, \a — 6|} (A. 10) 

in the d = 3 dimensional model. The values 0, 2, 4, 6 are the critical values of e(p). The value 
a = 3 is special, for which the level surface {e(p) = 3} has a flat point. In general, in d > 3 
dimensions, |||a|| is the minimum of \a — d\ and of all \a — 2m\, < m < d. 

Lemma A. 4. [Four Denominator Lemma fJE/] For any A > rj there exists C\ such that for 
any a G [0,6] with |Ja||| > A, 



/ 



dpdqdr < \ ogr] \ u 

a — e(p) + ir]\\a — e(q) + ir)\\a — e(r) + ir)\\at — e(p — q + r + u) + ir)\ ~ 

(All) 

uniformly in u. 
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The key idea behind the proof of Lemma IA.4I is to study the decay property of the Fourier 
transform of the level sets S a := {p : e(p) = a}, i.e. the quantity 

MO = / := / I J 7 ^ dm a{p), 

J* a h a |Ve(p)| 

where m a is the uniform (euclidean) surface measure on E . Defining 

e^dp 



/(f) - / 

we get by the coarea formula that 



a — e(p) + z?7 1 



1(0= ^^.(fl- 

From the convolution structure of X, we have 

X= / I(£)V^d£< / |/(0| 4 d^< ( f - ) 4 sup / \M0M- 

J J \J \a-a + irj\/ a J 

The first factor on the right hand side is of order | log ?7 1 4 , so the key question is the decay of 
the Fourier transform, jw a (£)' °f the level set, for large £. It is well known (and follows easily 
from stationary phase calculation) that for surfaces Scl 3 whose Gauss curvature is strictly 
separated away from zero (e.g. strictly convex surfaces), the Fourier transform 

MO := J e^dp 

decays as 

wei < § 

for large £. Thus for such surfaces the L 4 -norm of the Fourier transform is finite. For surfaces, 
where only one principal curvature is non- vanishing, the decay is weaker, |^| -1 ^ 2 . Our surface 
S a even has a flat point for a = 3. A detailed analysis shows that although the decay is not 
sufficient uniformly in all direction £, the exceptional directions and their neighborhoods are 
relatively small so that the L 4 -norm is still finite. The precise statement is more involved, 
here we flash up the main result. Let K = K{p) denote the Gauss curvature of the level set 
S a and let u(p) denote the outward normal direction at p 6 S a . 

Lemma A. 5 (Decay of the Fourier transform of S a , [19]). For v e S 2 and r > we have 

M TV ) ~ - + 



*/*\D(v)\V* + V 



where D(y) = mm,- \v(pj) — u\ where pj 's are finitely many points on the curve T = {K 
0} n S a; at which the neutral direction of the Gauss map is parallel with T. 
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This lemma provides sufficient decay for the L 4 -norm to be only logarithmically divergent, 
which can be combined with a truncation argument to obtain (1A.11I) . 



We stated all above estimates (with the exception of the precise ( 1A.3I) and ( 1A.9I) ) for 
the bare dispersion relation e(p). The modification of these estimates for the renormalized 
dispersion relation u(p) is straightforward if we are prepared to lose an additional r]~ K factor 
by using 

1 1 cA 2 1 + cAV 1 
l < 1 < • 

\a — u(p)+ir]\ \a — e(p) + ir]\ \a — e(p) + ir)\ \a — u(p) + ir/\ \a — e(p) + ir/\ 

B Power counting for integration of Feynman diagrams 

Lemma B.l. Let T be a connected oriented graph with set of vertices V(T) and set of edges 
E(T). Let N = \V(T)\ be the number of vertices and K = \E(T)\ be the number of edges. Let 
p e G M d denote a (momentum) variable associated with the edge e G -E'(r) and p denotes the 
collection {p e : e G E(T)}. Since the graph is connected, K > N — 1. Let R : R d — > C denote 
a function with finite L 1 and L°° norms, \\R\\i and ||-R||oo- For any vertex w G V(T), let 

Q w := / A w (p) Y[ R{Pe)dp e , 

J eeE{F) 

where 

a.(p) = n 5 ( e ( b - 12 ) 

vGV(F) e : e~v 

is a product of delta functions expressing Kirchoff laws at all vertices but w. Here e ~ v 
denotes the fact that the edge e is adjacent to the vertex v and ± indicates that the momenta 
have to be summed up according to the orientation of the edges with respect to the vertex. 
More precisely, if the edge e is outgoing with respect to the vertex v, then the sign is minus, 
otherwise it is plus. 

Then the integral fl w is independent of the choice of the special vertex w and the following 
bound holds 

Proof. First notice that the arguments of the delta functions for all v G ^(T), 

e : e~ti 

sum up to zero (every p e , e G E(T), appears exactly twice, once with plus once with mi- 
nus). This is the reason why one delta function had to be excluded from the product flB.12j) . 
otherwise they would not be independent. It is trivial linear algebra to see that, indepen- 
dently of the choice of w, all A w determine the same linear subspace in the space of momenta, 
(Rrf)^(D =R dK . 

Let T be a spanning tree in T and let T = E(T) denote the edges in T that belong to T. 
Let C = E(T) \ T denote the remaning edges, called "loops". Momenta associated with T, 
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{p e : e G T}, are called tree-momenta, the rest are called loop- momenta. Under Kirchoff 
law at the vertices, all tree momenta can be expressed in terms of linear combinations (with 
coefficients ±1) of the loop momenta: 

Pe = ^C r e i aPa, Ve G T, (B.13) 

where the coefficients a e ^ a can be determined as follows. Consider the graph TU{a}, i.e. adjoin 
the edge a to the spanning tree. This creates a single loop L a in the graph that involves some 
elements of T . We set a e ^ a = 1 if e G L a and the orientation of e and a within this loop 
coincide. We set a e>a = — 1 if the orientation of e and a are opposite, and finally a e ^ a = if 
e £ L a . 

It is easy to check that the N — 1 linear relations ( IB. 131) are equivalent to the ones deter- 
mined by the product ( IB. 121) of delta functions. For, if all tree momenta are defined as linear 
combinations of the loop momenta given by (1B.13|) . then the Kirchoff law is trivially satisfied. 



To check this, notice that when summing up the edge momenta at each vertex, only those 
loop momenta p a appear whose loop L a contains v. If v G L a , then p a appears exactly twice 
in the sum, once with plus and once with minus, since the momenta p a flowing along the loop 
L a once enters and once exits at the vertex v. Thus the relations (IB. 13[) imply the relations in 
( IB.12j) . and both sets of relations have the same cardinality, N — 1. Since the relations (IB.13j) 



are obviously independent (the tree momenta are all different), we obtain that both sets of 
relations determine the same subspace (of codimension d(N — 1)) of M. dK . 
Thus we can rewrite 



/ ( n R (pe) d Pz ) n 5 ( pe ~ a ^p°) ; 



in particular, Q w is independent of w. 

Now we estimate all factors R{p e ) with e G T by L°°-norm, then we integrate out all tree 
momenta and we are left with the L 1 -norms of the loop momenta: 

M < \\RC~ 1 [ (111^)1^) (II^-E^)^) = ll^l^ 1 !!^™ 

completing the proof of Lemma IB. 11 □ 



References 

[1] R. Adami, L. Erdos, Rate of decoherence for an electron weakly coupled to a phonon 
gas. J. Statis. Physics 132, no. 2, 301-328 (2008) 

[2] M. Aizenman and S. Molchanov, Localization at large disorder and at extreme ener- 
gies: an elementary derivation, Commun. Math. Phys. 157, 245-278 (1993) 

[3] M. Aizenman, R. Sims, S. Warzel, Absolutely continuous spectra of quantum tree 
graphs with weak disorder. Comm. Math. Phys. 264, 371-389 (2006) 



89 



[4] P. Anderson, Absences of diffusion in certain random lattices, Phys. Rev. 109, 1492- 
1505 (1958) 

[5] D. Benedetto, F. Castella, R. Esposito and M. Pulvirenti, From the N-body 
Schrodinger equation to the quantum Boltzmann equation: a term-by-term conver- 
gence result in the weak coupling regime. Commun. Math. Phys. 277, 144 (2008) 

[6] R. Brown, Philosophical Magazine N. S. 4 161-173 (1828), and 6 161-166 (1829) 

[7] J. Bourgain, Random lattice Schrodinger operators with decaying potential: some 
higher dimensional phenomena. Lecture Notes in Mathematics, Vol. 1807, 70-99 
(2003). 

[8] L. Bunimovich, Y. Sinai: Statistical properties of Lorentz gas with periodic configu- 
ration of scatterers. Commun. Math. Phys. 78 no. 4, 479-497 (1980/81), 

[9] T. Chen, Localization Lengths and Boltzmann Limit for the Anderson Model at Small 
Disorders in Dimension 3. J. Stat. Phys. 120, 279-337 (2005) 

[10] W. De Roeck, J. Frohlich: Diffusion of a massive quantum particle coupled to a quasi- 
free thermal medium in dimension d>4. Preprint, http://arxiv.org/abs/0906.5178 

[11] W. De Roeck, J. Frohlich, A. Pizzo: Quantum Brownian motion in a simple model 
system. Comm. Math. Phys. 293 (2010), no. 2, 361-398. 

[12] Disertori, M., Spencer, T.: Anderson localization for a supersymmetric sigma model. 
Preprint larXiv:0910.3~325l 

[13] Disertori, M., Spencer, T., Zirnbauer, M.: Quasi- diffusion in a 3D Supersymmetric 
Hyperbolic Sigma Model. Preprint arXi v:0901. 1652, 

[14] D. Diirr, S. Goldstein, J. Lebowitz: A mechanical model of Brownian motion. Com- 
mun. Math. Phys. 78 (1980/81) no. 4, 507-530. 

[15] A. Einstein, Uber die von der molekularkinetischen Theorie der Warme geforderte 
Bewegung von in ruhenden Fliissigkeiten suspendierten Teilchen, Ann. der Physik, 
17, 549-560 (1905), and: Zur Theorie der Brownschen Bewegung, Ann. der Physik, 
19 180 (1906). 

[16] K. Efetov. Supersymmetry in Disorder and Chaos, Cambridge University Press, 1999. 

[17] D. Eng and L. Erdos, The Linear Boltzmann Equation as the Low Density Limit of 
a Random Schrodinger Equation. Rev. Math. Phys, 17, No. 6, 669-743 (2005) 

[18] L. Erdos, Linear Boltzmann equation as the long time dynamics of an electron weakly 
coupled to a phonon field. J. Stat. Phys., 107, 1043-1128 (2002) 

[19] L. Erdos, M. Salmhofer, Decay of the Fourier transform of surfaces with vanishing 
curvature. Math. Z. 257 no 2., 261-294 (2007) 



90 



[20] L. Erdos and H.-T. Yau, Linear Boltzmann equation as scaling limit of quantum 
Lorentz gas. Advances in Differential Equations and Mathematical Physics. Con- 
temporary Mathematics 217, 137-155 (1998). 

[21] L. Erdos and H.-T. Yau, Linear Boltzmann equation as the weak coupling limit of the 
random Schrodinger equation, Commun. Pure Appl. Math. LIII, 667-735, (2000). 

[22] L. Erdos, M. Salmhofer and H.-T. Yau, Towards the quantum Brownian motion. 
Lecture Notes in Physics, 690, Mathematical Physics of Quantum Mechanics, Se- 
lected and Refereed Lectures from QMath9. Eds. Joachim Asch and Alain Joye. pp. 
233-258 (2006) 

[23] L. Erdos, M. Salmhofer, H.-T. Yau, Quantum diffusion of the random Schrodinger 
evolution in the scaling limit. Acta Math. 200, no. 2, 211-277 (2008) 

[24] L. Erdos, M. Salmhofer, H.-T. Yau, Quantum diffusion of the random Schrodinger 
evolution in the scaling limit II. The recollision diagrams. Commun. Math. Phys. 
271, 1-53 (2007) 

[25] L. Erdos, M. Salmhofer, H.-T. Yau, Quantum diffusion for the Anderson model in 
scaling limit. Ann. Inst. H. Poincare 8, 621-685 (2007) 

[26] R. Froese, D. Hasler, W. Spitzer, Absolutely continuous spectrum for the Anderson 
model on a tree: a geometric proof of Klein's theorem. Comm. Math. Phys. 269, 
239-257 (2007) 

[27] J. Frohlich, F. Martinelli, E. Scoppola and T. Spencer, Constructive proof of local- 
ization in the Anderson tight binding model. Comm. Math. Phys. 101, no. 1, 21-46 
(1985) 

[28] J. Frohlich and T. Spencer, Absence of diffusion in the Anderson tight binding model 
for large disorder or low energy, Commun. Math. Phys. 88, 151-184 (1983) 

[29] F. Germinet, A. Klein, J. Schenker: Dynamical derealization in random Landau 
Hamiltonians. Ann. of Math. 166, 215-244 (2007) 

[30] T. G. Ho, L. J. Landau and A. J. Wilkins: On the weak coupling limit for a Fermi 
gas in a random potential. Rev. Math. Phys. 5 (1993), no. 2, 209-298. 

[31] I. Ya. Goldsheid, S. A. Molchanov and L. A. Pastur, A pure point spectrum of the 
one dimensional Schrodinger operator. Funct. Anal. Appl. 11, 1-10 (1997) 

[32] Y. Kang, J. Schenker: Diffusion of wave packets in a Markov random potential. J. 
Stat. Phys. 134 (2009), 1005-1022. 

[33] H. Kesten, G. Papanicolaou: A limit theorem for stochastic acceleration. Comm. 
Math. Phys. 78 19-63. (1980/81) 

[34] A. Klein, Absolutely continuous spectrum in the Anderson model on the Bethe lattice, 
Math. Res. Lett. 1, 399-407 (1994) 



91 



[35] T. Komorowski, L. Ryzhik: Diffusion in a weakly random Hamiltonian flow. Comm. 
Math. Phys. 263, 277-323 (2006) 

[36] O. Lanford, On the derivation of the Boltzmann equation. Asterisque 40, 117-137 
(1976) 

[37] P. A. Lee, T. V. Ramakrishnan, Disordered electronic systems. Rev. Mod. Phys. 57, 
287-337 (1985) 

[38] J. Lukkarinen and H. Spohn, Kinetic limit for wave propagation in a random medium. 
Arch. Ration. Mech. Anal. 183, 93-162 (2007) 

[39] J. Lukkarinen and H. Spohn, Weakly nonlinear Schrddinger equation with random 
initial data. Preprint larXiv:0901. 3283 

[40] P. Markowich and C. Ringhofer, Analysis of the quantum Liouville equation. Z. 
Angew. Math. Mech, 69, 121-129 (1989). 

[41] J. B. Perrin, Mouvement brownien et realite moleculaire. Annales de chimie et de 
physiqe, VIII 18, 5-114 (1909). 

[42] C.-A. Pillet, Some results on the quantum dynamics of a particle in a Markovian 
potential. Comm. Math. Phys. 102 (1985), no. 2, 237254. 

[43] B. Simon and T. Wolff, Singular continuous spectrum under rank one perturbations 
and localization for random Hamiltonians. Comm. Pure Appl. Math. 39(1), 75-90 
(1986) 

[44] M. von Smoluchowski, Zur kinetischen Theorie der Bwownschen Molekularbewegung 
und der Suspensionen. Ann. der Physik, 21, 756-780 (1906). 

[45] H. Spohn: Derivation of the transport equation for electrons moving through random 
impurities. J. Statist. Phys. 17 (1977), no. 6., 385-412. 

[46] D. Vollhardt, P. Wolffe, Diagrammatic, self- consistent treatment of the Anderson 
localization problem in d < 2 dimensions. Phys. Rev. B 22, 4666-4679 (1980) 

[47] N. Wiener, Differential space. J. Math and Phys. 58, 131-174 (1923) 



92 



